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OXIDOREDUCTASE GENE ASSOCIATED WITH THE FRAl 6D FRAGILE SITE 
FIELD OF THE INVENTION 

This invention relates to the field of cancers and in particular to nucleotide sequences of the 
5 fragile site FRA16D, of the FOR gene and amino acid sequences of its encoded proteins, as 
well as derivatives and analogs thereof and agents capable of binding thereto, and uses of 
these, such as in diagnosis and therapy. 

BACKGROUND OF THE INVENTION 

10 Cancers are a significant factor in mortality and morbidity, with onset rates of forms of cancer 
being quite high in all places of the world. Early detection greatly improves the chances of 
remission and considerably reduces the chance of the cancer metastasizing. The treatment of 
early stage cancers is also much more benign so that there are less severe residual effects 
resulting from the treatment. Accordingly early detection of cancers is a high priority in 

1 5 management of the diseases. Similarly treatment of various cancers are of mixed outcome and 
it is desirable to provide for alternative treatments at least for certain forms of cancers. 

Cancers are of many different types and severity, however the uncontrolled proliferation of 
cancers cells is invariably associated with damaged DNA of one form or another. Some types 
20 of cancer are familial in the sense that there is an mcreased risk of contracting cancer, but the 
hereditary characteristics in most cancers are not simple and there is only usually a few fold 
increased risk among family members as compared to the general population. The DNA 
damage in most cancers are associated with somatic mutations the acquisition of which is 
thought to be associated with exposure to certain environmental factors. 

25 

A very large number of genes have been identified as being associated with the onset of cancer 
and this reflects the complexity of the regulation of normal cellular proliferation. These genes 
can be categorised into three groups the first of which includes the so called oncogenes or 
protooncogenes which are often associated with positive control elements, enhancing cellular 
3 0 proliferation in the normal cellular cycle. Certain mutations in these positive control elements 
trigger uncontrolled proliferation. A second group are the so called tumour suppressor genes, 
which are genes that normally suppress proliferation, and inactivation or reduction in activity 
of these leads to abnormal proliferation. These tend to act in a recessive fashion. A third 
group are the so-called mutator genes which are normally responsible for maintaining genome 
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integrity during the proliferative cycle, and if these are defective then the general mutation rate 
increases and the consequent chance of providing for a transforming mutation increases. 

One mapping technique to locate the site of chromosomal lesion in a cancer cell is known as 
5 the loss of heterozygosity (LOH) technique. Eukaryotes have two copies of each 

chromosome, apart from the sex chromosomes, and as a result cancers that result from 
mutations in a tumour suppressor generally require two mutations. Sometimes one mutation 
will be inherited, and a second mutation is required to trigger the cancer leading to loss of 
function of both copies of the gene in the individual. Quite often these secondary mutations 
10 will be deletions and their location can be detected by checking the presence of highly 

polymorphic genetic markers from the tumour tissue and from another site such as blood. The 
markers that are heterozygous in normal tissue and have become homozygous in the cancer 
tissue can give an indication of the lesion concerned. 

15 The LOH technique is however quite difficult to routinely perform and inteipret reliably, this is 
particularly so because any tumour sample usually is also contaminated by non-tumour tissue, 
and it is at times difficult to distinguish a result because of a decreased relative intensity, and 
quantitative amplification techniques will often need to be employed. Another limitation relates 
to the availabihty of a suitably dense array of markers which generally leads to the detection 

2 0 only of larger deletions. A single tumour may have LOH in many distinct regions, but LOH 
will only be detected in those regions that have been tested. The LOH technique is thus 
unsuited to diagnostic purposes. 

The use of these LOH studies have identified a number of sites some of which correspond to ■ 
25 regions of the chromosome termed fragile sites. 

Fragile sites appear as breaks, gaps or decondensations on metaphase chromosomes. These 
non-random breaks appear in defined locations on human chomosomes under appropriate 
conditions. 

30 

There are two distinct forms of chromosomal anomaly referred to as fragile sites (Sutherland et 
aL, 1998)). The *rare' form is polymorphic in the population and is accounted for by the 
expansion of repeat DNA sequences beyond a copy number limit. The 'common* form is 
present at many loci in all individuals. Despite determination of the complete sequence 
35 analysis of the common fragile site, FRA3B (Boldog et aL, 1996; Inoue et aL, 1997; Mimori 
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et al, 1999) and the partial sequence analysis of the common fragile sites, FRA7G and 
FRA7H (Huang et aL, 1998a,b; Mishmar et aL, 1998) the molecular basis for common fragile 
sites is not yet understood. 

5 Fragile sites are also distinguished by the culture conditions required for their induction. 
Common fragile sites are (mainly) induced by aphidicolin, whereas the rare fragile sites are 
induced by either high or low concentrations of folate or the AT-rich binding chemicals such 
as distamycin A or by bromodeoxyuridine. The role of chromosomal fragile sites in human 
genetic disease was thought to be restricted to fragile X syndrome caused by the FRAXA 
10 fragile site, however a mild form of mental retardation has been associated with FRAXE and 
the FRAllB fragile site appears to predispose to 1 Iq breakage leading to some cases of 
Jacobsen syndrome. 

Fragile sites have been proposed to have a determining role in cancer associated chromosomal 
15 instability. There are in excess of 100 fragile sites in the human genome of which the fragile 
site FRAllB is located within the CBL2 proto-oncogene (Jones et aL, 1994, 1995) and the 
FRA3B, FRA7G and FRA16D sites have been located within or adjacent to regions of 
instability in cancer cells (Ohta et al, 1996; Sozzi et al., 1996; Engelman et aL, 1998; Huang 
et ai, 1998a,b; Chen et aL, 1996; Latil et ai, 1997). 

20 

Recent detailed molecular analysis of fragile site loci has demonstrated that the common fragile 
site FRA3B is located within a region subject to localised deletion and that this deletion is 
frequently observed in certain forms of cancer (Ohta et al, 1996; Sozza et al, 1996). FRA3B 
lies proximal to the major region of LOH on chromosome 3p previously shown to be 
25 responsible for deletion of the VHL tumour suppressor (Gnarra et aL^ 1994). The cancer- 
associated FRA3B deletions can result in inactivation of a gene (FHTT -Fragile Histidine 
Triad) which spans the fragile site (Croce et al US patent 5928884). The FHITgcnc product 
has been shown to have a role in tumour growth (Siprashvilli et aL, 1997) but quite what the 
significance or nature of that role is subject of active research at the present. 

30 

Another common fragile site FRA 7G has also been shown to be located within an about 1Mb 
region of frequent deletion in breast and prostate cancer (18,19) as well as squamous cell 
carcinomas of the head and neck, renal cell carcinomas, ovarian adenocarcinomas and colon 
carcinomas (20). The human caveolin-1 and -2 genes are located within the same conunonly 
35 deleted region as FRA 7G, Caveolin-1 has been shown to have a role in the anchorage 
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dependent inhibition of growth in NIH 3T3 cells (21). The caveolins are therefore candidates 
for the tumour suppressor gene presumed to be located in the FRA 7G region (20). 

Another common fragile site which is aphidicolin inducible is the FRA16D site. FRAI6D has 
5 been localised at I6q23.2 within a large overlapping region of chromosomal instability in 
breast and prostate cancer as defined by loss-of-heterozygosity (24,25). One study has found 
that a significant proportion (77%) of breast cancers carries a deletion at 16q23.2, including 
the marker D16S518 in the inmiediate vicinity of FRA16D (24). 

10 There has been no characterisation of a nucleic acid or protein associated with the FRA16D site 
and the physical location of FRAI6D has not yet been determined. Such a characterisation is 
desirable to enable potentially early diagnosis and assessment of risk as well as potentially 
providing for a therapeutic treatment. 

1 5 SUMMARY OF THE INVENTION 

The inventors have produced a detailed physical map of the FRA16D region which provides 
markers to identify a relationship between this fragile site and DNA instability in neoplasia and 
which, further, may allow better diagnosis of cancers associated with the region. This 
analysis reveals the existence of an intimate relationship between the location of FRA16D and 

2 0 homozygous deletions in various tumours, culminating in the coincidence of two tumour cell 
DNA breakpoints with the most likely position of the fragile site. 

The inventors have also characterised the nucleic acid associated with FRA16D especially by 
nucleic acid sequencing. Analysis of the DNA sequence and EST sequences associated with 
25 the region has identified a number of introns and exons which are found to exist in at least four 
different splice variants of what will be termed protein FOR. RNA analysis has also been 
conducted and thus far at least four species of mRNA associated with the region have been 
detected. 

30 In a first aspect the invention could be said to reside in a method of detecting genetic variations 
of a I6q23.2 target in the 16q23.2 region of the chromosome, said method comprising the 
steps of contacting target nucleic acid with one or more oligonucleotides suitable for use as 
hybridisation probe or PCR priming specific for binding the I6q23.2 specific target, and 
ascertaining the binding of said oligonucleotide. 



35 
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It will be understood from the specification that the 16q23.2 specific target might be selected to 
be within the group comprising the FOR gene, the FRA16D site, or mRNA encoding FOR 
protein or two or more of these collectively. The target may include chromosomal 
rearrangements and mutations thereof and the rearrangements or mutations may, in one form, 
5 be cancer associated. The variations may include markers in the region such as set forth in this 
specification including in figures 1, 2 and 6. 

The 16q23.2 target within the FOR gene might be selected from one or more of the group 
comprising exons lA, 1, 2, 3, 4, 5, 6, 6A, 7, 8, 9, 9A, 10. lOA, lOB or exons located 
10 between two adjacent exons or control elements in other adjacent regions that effect an altered 
expression of the FOR gene. Such adjacent regions may have a promoter, enhancer elements 
or other regulatory elements. The target may be any one of the splice variants currently 
identified as FOR T, FOR II. FOR Til or FOR IV or it nught include other combinations of two 
or more of the exons. 

15 

It is noted in particular that breakpoints of three out of five I6q23.2 translocations associated 
with multiple myeloma map within the alternate splice of this FOR intron, that is, between 
exons 8 and 9A. and in one form a preferred target is the intron between exons 8 and 9A or a 
portion thereof. 

20 

In some circumstances the method might be used to detect any rearrangements in a larger target 
area. Thus it might be desired to use a plurality of oligonucleotides which might be selected to 
bind to a range of target binding sites within the I6q23.2 specific target to detect for a range of 
changes. This might be used for example to detect for chromosomal rearrangements such as 

25 ' deletions within the FRA16D site or beyond that in the broader 16q23.2 region. The plurality 
of oligonucleotides or a plurality of specific binding sites of the I6q23.2 target are preferably 
spacially separated so that binding of each of the plurality of oligonucleotides or binding to the 
plurality of specific binding sites can be separately ascertained. The spacial separation might, 
for example, be conveniently provided as an array on a solid support, for example in a form 

3 0 that is common referred to as a gene chip (see for example patent specifications US 52885 14 
and US 5593839). Instead of a plurality of oligonucleotides it may be desired that the target 
be probed by a single oligonucleotide. 

Alternatively the target area might be small, thus for example the method might be used to 
35 ascertain the presence or absence of a particular mutation or allelic variation in the I6q23.2 
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target. Thus for example a target of the 6A, 1 A, 9 or 10 or 9 A exon will distinguish between 
FOR I, FOR IV, FOR IT and FOR III transcription variants. These may also be used to 
quantify differences in expression of the splice variants FORII and FORI on the one hand and 
FORII on the other. It might be expected that because the FORIII only has the WW domains 
5 in contrast to FOR II and FOR I a significant biological effect may result from variations in the 
balance of expression of these different variations of FOR, such variations may give an 
indication of individuals who are at risk of contracting a form of tumoun A small target area 
might also be adequate for use with gross chromosomal rearrangements in so far as this might 
be used to determine the presence or absence of junctions of known chromosomal 
10 rearrangements, or alternatively the binding or non binding of one or more of a plurality of 
oligonucleotides. The target area might also be selected to allow for assessment of the 
presence or absence of cancer associated point mutations or small DNA rearrangements, using 
suitably selected oligonucleotides. 

15 The base sequence of the oligonucleotide chosen will depend upon several factors known in 
the art. Primarily the sequence of the oligonucleotide will be determined by its capacity to bind 
to the target nucleic acid sequence. The nature of the sequence will depend to some extent on 
the stringency of the hybridisation required, and whether or not it is desired for one 
oligonucleotide to detect variation in sequence or not. If variation in one nucleotide is required 

2 0 the stringency of the hybridisation will be high. The length of the oligonucleotide will also be 
determined by the stringency of the reaction required. 

The binding might be by in situ hybridisation of a chromosomal spread, or other suitable 
spacial arrangement of the target region such as for example on a so called gene chip. Such 

2 5 hybridisation methods will generally provide for an oligonucleotide and be capable of binding 

the target over a span of at least 15 nucleotides. In the case of hybridisation techniques the 
oligonucleotides will generally carry a label which can be detected by known measuring 
methods, especially when bound to the 16q23.2 target. Such labels might include radiolabels 
such as or a fluorescent marker. 

30 

The method might require a preamplification step whereby the target nucleic acid is amplified, 
to make it easier to ascertain the binding or non binding of the nucleic acid to the target site. 

On the other hand the oligonucleotide might be suitable for amplification of a segment of the 

3 5 target nucleic acid such as by PGR. in which case the size of the target may be somewhat 
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different. With this variation two oligonucleotides might be selected, to provide for 
amplification of at least part of the target nucleic acid, at least one of the oligonucleotides is 
required to bind in the target. 

5 The target nucleic acid might be presented in any one of a number of physical forms. Nucleic 
acid from an individual might be isolated and perhaps digested by a restriction enzyme and 
spread out such as by electrophoresis on an agarose or polyacrylamide gel, so that binding of 
the oligonucleotide can be effected whilst the target nucleic acid is supported by the gel or this 
might be supported on other solid medium such as a gene chip or a metaphase chromosomal 
10 spread. Alternatively the oligonucleotide or oligonucleotides might be fixed, and the target 
nucleic acid might either be diminished in size, or not, and then binding of fragmented targets 
to the fixed oligonucleotide determined. 

The target nucleic acid niight be in the form of chromosomal DNA. or might be cDNA or 
15 mRNA. 

This method might also be used to detect other variants, homologs or analogs of the FRA16D 
site, FOR gene, or other nucleic acid sequences disclosed in this specification. Thus it might 
be, for example desirable to determine analagous gene in livestock, domestic, laboratory or 
2 0 sporting animals. Alternatively one might wish to determine another analogous protein that 
plays a similar role in humans. 

In a second aspect the invention relates to a method of detecting the number of alleles for one 
or more markers in the 16q23.2 target, and this may be a means of perhaps providing a 

2 5 measure of the loss of heterozygosity in an individual. This aspect of the invention therefore 
relates to locating a deletion that overlaps with the HiAl6D region. The method might be 
achieved by providing a first set of one or more oligonucleotides and a second set of one or 
more oligonucleotides the first set of oligonucleotide being specific for a first variant of the 
target nucleic acid, the second set of oligonucleotides being specific for a second variant of the 

30 target nucleic acid, the fu-st and second set of oligonucleotides being labelled so as to be 
capable of being distinguished, and the method comprising the steps of comparing the 
proportion of binding of the first and second set of oligonucleotides. A method of this sort is 
set forth in US patent specification 5928870 to Lapidus et al, which for purposes of practicing 
the invention is incorporated herein by reference. 



35 
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It will be understood that the above method is useful in categorising the risk of contracting 
certain types of cancer associated with the FRA16D fragile site or other portion of the 16q23.2 
region. 

5 In a third aspect the invention could be said to reside in a method of determining the level of 
expression of the FOR gene or any one or more exon thereof, by determining the level of 
mRNA expression using a probe specific for the FOR gene or exon thereof. This might be 
used to determine the dysregulation of FOR expression. It will be understood that it may be 
desired to also determine the level of expression of variants of the gene or exons including 
10 rearrangements and mutants including those associated with cancers. This is likely to give a 
prognosis in relation to at least certain cancers that are currently contracted or perhaps an 
indication of the risk of contracting one or more types of cancer. 

In a fourth aspect the invention could be said to reside in an isolated nucleic acid molecule 
1 5 selected from the group comprising 

a) any one or more of the nucleic acids sequences disclosed in the figures hereto 
or parts thereof 

b) FRA16Dsite 

c) FOR gene, or exons thereof 
20 d) mRNA of the FOR gene 

e) cDN A of the FOR gene 

f) variants of the above including, chromosomal rearrangements and mutations of 
sequences set out in a) to e) including those variants associated with cancers 

g) nucleic acid sequence capable of hybridising specifically to any sequence of a 
25 to e above or its complement, and especially those capable of doing so under 

stringent conditions. 

The nucleic acid molecule might include a mosaic from within the above molecules such as a 
combination of two or more of the group comprising the following, exon lA, 1, 2, 3, 4, 5, 6, 

30 6A, 7, 8, 9. 9A, 10, lOA, lOB or introns located between two adjacent exons or control 
elements in other adjacent regions that effect an altered expression of FOR, and it will be 
understood that such a mosaic includes a molecule encoding cDNA of variants of the FOR 
protein, whether a wild type allele, a mutated version, or otherwise rearranged. It will thus be 
understood that the invention includes antisense molecules to any regions of control that might 

35 be contemplated above. Such antisense molecules may be used to vary the expression of such 
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protein as are produced by the FOR gene or perhaps adjacent genes such as the c-MAF gene. 
One may also wish to reduce the expression of one of the splice variants of FOR to provide 
treatment of a given condition, thus for example it might be desired to have antisense 
specifically to FOR HI if FOR in is overexpressed in the condition. 

5 

It will be understood that such nucleic acids include portions of nucleic acids that are suitable 
for use as primers or probes. 

The invention may also be said to include nucleic acids encoding a tumour associated gene 
10 from a human or animal capable of hybridizing with any nucleic acid of the fourth aspect of the 
invention. 



In a fifth aspect the invention could be said to reside in a recombinant vector including one or 
more nucleic acid sequences as set out above, and preferably operably linked to a control 
15 element such as might include a functional promoter. The recombinant vector might be used 
as an expression vector to produce or overproduce FOR protein or variants thereof, or perhaps 
overproduce nucleic acids associated with the FOR gene such as an antisense molecule. 
Suitable vectors are generally available commercially or may be constructed as described 
elsewhere or as is known in the art. 

20 

In a sixth aspect the Invention could be said to reside in an isolated protein molecule, the 
protein molecule being selected from the group comprising the following: 

a) a FOR protein, or 

b) a mutant or variant FOR protein which might optionally be associated with a 
25 cancer 

In a sevendi aspect the invention could be said to reside in a polypeptide produced by any two 
or more exons selected from the group comprising 1 A, 1, 2, 3, 4, 5, 6, 6A, 7, 8, 9, 9A, 10, 
lOA, lOB joined, said exons being either complete exons or partial, and may be variants. 



30 



The invention might also encompass a purified cancer associated protein including a string of 
amino acids unique to a FOR protein and more particularly as set out in figure 9, preferably 
said amino acid string being at least 10 amino acids long and exhibiting at least 70% amino 
acid homology more preferably at least 90% homology. 



35 
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The protein may have an oxidoreductase domain and/or one or more WW domains or may 
have a role in DNA replication of chromosomal division. 

In another form the purified cancer associated protein includes an amino acid string with an 
5 amino acid sequence homology of greater than 70% but more preferably greater than 90% with 
an amino acid string selected from the group comprising: 

TGANSGIGFETAKSFALHGAHVILACR (SEQ ID No 1), 

LHVLVCNAATFALPWSLTKDGLEITFQVNHLGHFYLVQLLQDVL (SEQ ID No 
2), 

10 YNRSKLCNILFSNELHRRLSPRGVTSNAVHPG (SEQ ID No 3) 

In another form the purified cancer associated protein includes a WW domain having an amino 
acid string of 10 amino acid or greater or preferably 20 amino acids or greated with an amino 
acid sequence homology of greater than 70% but preferably greater than 90% with an amino 
15 sequence selected from the group comprising the region 16 to 49 or 57 to 90 of the FOR gene 
(as graphically illustrated in Figure lOA), being the amino acid strings 

DELPPGWEERTTKDGWVYYANHTEEKTQWEHPKT (SEQ ID No 4)and 
GDLPYGWEQETDENGQVFFVDHINKRTTYLDPRL (SEQ ID No 5) 

20 In another form the purified cancer associated protein includes at least one oxidoreductase 
domain having an amino acid string of 10 amino acid or greater or preferably 20 amino acids 
or greater with an amino acid sequence homology of greater than 70% but preferably greater 
than 90% with an amino sequence selected from the group comprising the region 130 to 156 
or 204 to 247 or 293 to 324 of the FOR gene (as graphically illustrated in Figure lOA). 

25 

In an eighth aspect the invention includes an agent capable of selectively binding a FOR 
protein or fragment or variant thereof. Such agents may be particularly useful in diagnostic 
methods. Such an agent may also be used to bind a protein containing a string of amino acids 
unique to FOR or variant thereof and in particular such variants that are currently known to be 

30 associated with one or more forms of cancer. The agent may selectively bind to the variant 
FOR as compared to an FOR protein not associated with cancer. Such an agent niight be an 
agonist or an antagonist of FOR function. It might therefore be desired to provide for a 
number of agents each capable of selectively binding to a separate one of a number of variants 
of FOR so that it is possible to distinguish between variants. Thus for example it might be 

3 5 desired to target the C terminus of respectively FOR I, FOR 11, FOR HI and FOR IV to 
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progenitor cells which include incorporated therein a vector capable of producing an 
appropriate form of FOR protein. Accordingly in a ninth aspect the invention could be said to 
reside in a recombinant host cell having stably inserted therein DNA of any one of the forms of 
DNA contemplated in the third aspect of the invention. In preference the DNA is capable of 
5 producing a tumour suppressing form of FOR, and most conveniently this will be a wild-type 
form of FOR, which may simply be a cDNA molecule or the FOR gene. Alternatively 
however it may also be desired to have a host cell which has a DNA sequence capable of 
producing an antisense molecule in the case where a tumour promoting form of the FOR 
molecule is produced by the. individual to be treated, the antisense capable of reducing the level 
10 of expression of the FOR molecule. 

Methods of gene therapy are not limited to cases where the appropriate nucleic acid is delivered 
in a host cell, but also includes the administration of the nucleic acid specifically to the site of 
interest. 

15 

The recombinant host cell may not necessarily be used for therapeutic purposes, it may also be 
used for over-expression of the protein, or a nucleic acid associated with FOR, or the 16q23.2 
region, and may therefore be bacterial, yeast, plant, animal, preferably mammalian or human. 

20 Additionally the invention contemplates the provision of a transgenic non-human animal 

carrying recombinantly altered or overexpressing 16q23.2 DNA, preferably FRA16D or FOR 
gene, or other DNA of the fourth form of this invention. The recombinant DNA might be 
incorporated into the chromosome of the host, alternatively the host cell may carry said 
recombinant DNA in a self replicating element such as a plasmid. 

25 

The agents of the eighth aspect may be used for ascertaining the level of expression of FOR, 
variants or exons thereof, to determine whether there is an altered level of expression. Thus a 
western blot using a labelled agent may be used for the purpose using known techniques. 
This is another means of measuring dysregulation of expression. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 : Positional cloning of FRA I6D and location of loss of heterozygosity and 
translocation in cancer. 
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A. The locations of loss-of-heterozygosity regions in breast and prostate cancer 
and the approximate location of the FRA16D fragile site are indicated with 
respect to genetic markers (downward arrows) in the 16q23.2 region. Markers 
in the vicinity of FRA16D are shaded. The approximate location as determined 
by Chesi et aL{l) of multiple myeloma breakpoints and the c-MAF gene, (bar) 
are also shown by upward black arrows. Not to scale. 

B. Map of the contig of YAC subclones across the FRA16D region with 
respect to genetic markers and FRA16D, Open boxes indicate those YACs 
which map by fluorescence in situ hybridisation proximal to FRA16D, grey 
boxes are those which span FRA16D and black boxes indicate those YACs 
which map distal to FRA16D. Not to scale. 

Positional cloning of FRA16D and the extent of heterozygous and homozygous 
deletion in the AGS tumour cell line. 

A. Pulsed-Field gel map of -1Mb of the 'Right Hand Side' (RHS) of YAC 
My801B6 and the location of BACs, genetic and STS markers (key markers 
are boxed). Restriction sites between Afma336yg9 and WI2755 are shown in 

B. The AGS stomach cancer cell line homozygous deletion is indicated - 
shaded circles denote the presence and open circles the absence of PCR 
products for the STS markers. Maximal region of heterozygous deletion in 
AGS cell line is indicated by polymorphic D16S518 and D16S3029 PCR 
products, indicated as A and B alleles. The two AGS cell line chromosome 
16s are indicated by shaded bars. 

B. Restriction map of the critical FRA16D region (Afma336yg9 to D1653029) 
showing the location of key members of the lambda subclone tile path used for 
FISH in figure 3. Clones designated 1-n are from 325M3; others are from 
801B6. Open boxes represent those subclones found to map proximal (on the 
basis that >85% of their FISH signals were proximal to FRAJ6D), grey boxes 
those which appear to span the fragile site (less than 85% on one side or other 
ofFRA16D) and black boxes those which are distal to the fragile site (on the 
basis that >85% of their FISH signals were distal to FRA16D), I clones which 
gave high background on FISH were not scored. These and other 1 clones for 
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which FISH data were not obtained are included as thin boxes. STS 
localisation of the AGS homozygous breakpoints are indicated by the presence 
(shaded circles) and absence (open circles) of PGR products. 

5 Figure 3: Fluorescence in situ hybridisation (FISH) of lambda subclones against 
FRA16D expressing chromosomes. 



Each panel contains two FRA16D expressing partial metaphases, with and 
without FISH signal merged. In each case the width of the gap or break at the 

10 fragile site is greater than the width of the chromatid, (a) 1504 showing signal 

proximal to FRA16D; (b) 1 181 showing signal proximal and distal to FRA16D; 
(c) 1191 (upper) and 18 (lower) showing signal distal to FRA16D. Images of 
metaphase preparations were captured by a cooled CGD camera using the 
GhromoScan image collection and enhancement system (Applied Imaging 

15 International Ltd.). FISH signals and the DAPI banding pattern were merged 

for figure preparation. 

Figure 4: Fluorescence in situ hybridisation mapping of the lambda subclone tile path 
across FRA16D. 

20 

The individual lambda clones were scored against chromosomes where the 
FRA16D gap or break was greater than the chromatid width. Each increment 
represents a single FISH signal, n = number of chromosomes scored. Scores 
were plotted as proximal (p) and distal (d) with respect to FRA16D, Maximum 

2 5 location for FRA 16Ds indicated by arrows. Location of BAG clones 325M3 

and 353B15 is also shown. The boxed lambda contig subclones indicate those 
for which FISH signal results with respect to the FRA16D fragile site were 
obtained - open boxes, had >85% signal proximal to FRA16D; grey boxes, 
spanning (<85% signal on one side or other of FRA16D) and black boxes, had 

30 >85% signal distal to FRAI6D, While this figure is not to scale the location of 

the lambda clones can be determined from their position in figure 2. Thin 
boxed lambda clones are those for which FISH data was not obtained. 



Figure 5: Duplex PGR deletion detection at the FRA16D locus in tumour cell lines. 

35 
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PGR products from the duplex of STSG- 10102 and dystrophin DMD Pm were 
subjected to agarose gel electrophoresis and ethidium bromide staining. 
Template DNAs were seven tumour cell lines and blood bank and no DNA 
controls. Markers are Hpall digested pUC19. The position of the STSG-10102 
5 and DMD Pm PGR products are indicated by large grey-filled arrows while the 

primer dimer PGR artefact is indicated by a small white arrow. 

Figure 6: A. Extent of loss of heterozygosity regions in breast (25) and prostate cancer 
(24) in relation to the cytogenetic position of the FRA16D fragile site as 
10 determined by fluorescence in situ hybridisation of a tile path of subclones as 

show in figure 4. 

B. Map of YAGs which span FRA16D region showing approximate location of 
multiple myeloma breakpoints (MM.l, ANBL6, JJN3) determined by Ghesi et 
15 a/., (1). Location of homozygously deleted regions in AGS and HGTl 16 

tumour cell lines as determined by STS content. The locations of various 
partial BAG sequences (as evident by STS content) are indicated. Striped 
boxes = determined sequence accession numbers. 

20 G. The location of the FRA16D spanning DNA sequence and the respective 

exons of the alternative spliced FOR gene transcripts (numbered black boxes). 
Glusters of ESTs sequences representative of each of the alternative mRNA 3' 
ends are given. 

25 Figure 7: A. Northern blots of RNA from various human tissues. Expected FOR 

mRNAs (I-IV) are indicated for the respective DNA probes which span various 
exons of the FOR gene. H, heart» Br, brain; PI, placenta; Lu, lung; Li, liver; 
sM, skeletal muscle; K, kidney; P, pancreas. Arrows indicate FOR mRNAs 
{FOR I approx. 1.3kb. FOR II approx 2.2kb, FOR III approx 0.74 kb) 

30 

B. Northern blots of RNA from various human tissues, spleen, thymus, 
prostate, testis, ovary, small intestine, colon, peripheral blood leukocytes. 
Probes (I, II and HI) and (I and U) are as indicated in Figure 6. FOR I FOR II 
and FOR III mRNAs are indicated. Additional transcripts hybridizing to the 
35 FOR probes are indicated by grey arrows. 
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Figure 8 A. Is a composite DNA sequence of the predicted FOR I transcript (SEQ ID No 
28) constructed by conjoining overlapping EST, RT-PCR and 5' RACE DNA 
sequences. 

B. Is a composite DNA sequence of the predicted FOR II transcript (SEQ ID 
No 29) constructed by conjoining overlapping EST, RT-PCR and 5' RACE 
DNA sequences. 



10 C, Is a composite DNA sequence of the predicted FOR III transcript (SEQ ID 

No 30) constructed by conjoining overlapping EST, RT-PCR and 5' RACE 
DNA sequences. 

D. Is a composite DNA sequence of the predicted FOR /V transcript (SEQ ID 
15 No 3 1 ) constructed by conjoining overlapping EST and RT-PCR DNA 

sequences. 

Figure 9 are composite amino acid sequences predicted for the sequences for FOR I 

(SEQ ID No 32), FOR n (SEQ ID No 33), FOR HI (SEQ ID No 34) and FOR 
20 IV (SEQ ID No 35) as shown in figure 8, unique sequences are underlined. 

Figure 10 A. Is a diagrammatic representation of the four FOR amino acid sequences 

showing the locations of the alternate splice sites, the position of the exons, the 
three predicted oxido reductase domains, and the predicted WW domains. The 
2 5 sequence numbers refer to the amino acid sequence. 

B. Is an alignment of the sequences WW domains (SEQ ID No 4 and SEQ ID 
No 5) with each other and with the WW domain consensus sequence. 



30 

Figure 1 1 sets out DNA sequences for each of the exons identified for the FOR protein as 
well as a small amount of flanking intron sequence. The exon sequences are in 
uppercase, while the intron sequence is in lower case. Some nucleotide 
sequences are in bold, splice donor (GT) and acceptor (AG) sites, 
3 5 polyadenylation signals (AATAA) and initiation Methionine (ATG). For exons 
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1 and 1 A an upstream in phase termination codon is in italics and confirms the 
correct open reading frame in these mRNAs. 

Figure 12 is about 270kb of DNA sequence that overlaps and defines within it the 
5 FRA16D fragile site (SEQ ID No 53), which is shown to reside between exons 

8 and 9, this sequence has been deposited in the GenBank database and has 
been assigned accession number AF2 17490 as indicated in figure 6. 

Figure 13 is DNA sequence deposited with GenBank database and identifed by accession 
10 number AF217492 as indicated in figure 6, and which encompasses exon 7 

(SEQ ID No 52). 

Figure 14 is DNA sequence deposited with GenBank database and identifed by accession 
number AF2 17491 as indicated in figure 6, and which encompasses exon 6 
15 (SEQ ID No 51). 

Figure 15 shows FOR transcripts in normal and tumour cells. Products that were 
subjected to sequence analysis are indicated by arrowheads. 

20 A RT-PCR were either 'specific' for the FOR III transcript or 'general* being 

able to detect FOR I-III mRNAs. 



B 5'RACE specific for the FOR I. FOR 11 and FOR IE transcripts in 'normal' 
HS578BST cells and T47D tumour cells. 



25 



DETAILED DESCRIPTION OF THE INVENTION. 



EXAMPLE 1 - MAPPING OF THE FRA16D FRAGILE SITE 

30 

Materials and methods 

Isolation of DNA probes and YACs in the FRA16D region 

Nine DNA probes, ACH202 (D16S14), c311F2, c302A6 (D16S1075), c301F10 (D16S373), 
16-87 (D16S181), C306D2, 16-08 (D16S162), c307A12 and CRI-Ol 19 (D16S50) which had 
35 been physically mapped into the 16q23 region (30) were chosen for fluorescence in situ 
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hybridisation (FISH) against FRA16D expressing chromosomes. Four of these markers 
mapped within the same somatic cell hybrid breakpoint interval defined by the cell lines 
CY113(P) and CY121 (30). One of these, c306D2 mapped proximal to FRA16D by FISH 
while the others. c307A12, CRI-01 19 and 16-08 mapped distal to FRAJ6D, These probes 
5 were therefore used as starting points to isolate a contig of cloned DNA spanning FRA16D, 
In the Los Alamos National Laboratory database (www-ls.lanl.gov) an STS sequence from 
C306D2 was found within the CEPH YACs My903D9, My912D2 and My933H2 while an 
STS in C307A12 was found in My891F3 and My972D3. These YACs were obtained from 
CEPH and the prepared DNA subjected to Pst I digestion, Southern blotted and probed with 

10 16-08, 16-87, CRI-01 19, c306D2 and c307A12 in succession in order to confirm their • 

content. In addition a search of the Whitehead Institute database (www-genome.wi.mit.edu) 
revealed that the two sets of YACs were joined into a contig by the YACs My801B6, 
My845D9 and My944D8. Each of these YACs was used as template DNA to assess STS 
content (D16S518, Afma336yg9, WI2755, STSG-10102 and D16S3029) and subjected to 

15 FISH to assess position with respect to FRA16P (Figure IB). 

Additional probes, STSs and BACs from the FRA16D region 

Additional probes were generated from the YAC 801B6 by subcloning Pst I digests of YAC 
DNA and screening with total human DNA as probe. These subclones were digested with 

20 Hinc n to identify and isolate non-repetitive DNA fragments as probes. This generated 

markers HI 3m, H22s, H23m, H29m and H40m. Genome System Inc. BAC library filters 
were screened with the probes D16S518, Afma336yg9, WI-2755, STSG-10102, H22s, 
H29M and D16S3029 and nine BAC clones including 379C2, 325M3 and 353B15 were 
identified. An additional STS, named 2AS, was established by 'bubble' PCR from the end- 

2 5 fragment of BAC 353B 1 5 and was isolated as described by Gecz et al (31). Briefly, the BAC 
DNA was digested with Alu I and ligated to the annealed bubble linkers. The final PCR was 
carried out with a combination of Not I-A bubble primer and Sp6-promoter primer as 
described except an annealing temperature of SS'^C was used. These STSs and hybridisation 
probes were used to establish restriction maps of the YAC My801B6 and the BACs (Figure 

30 2 A). 

Subcloning and contig assembly 

The YAC My801B6 and the BAC 325M3 were used as DNA templates for establishing 
lambda subclone libraries in IGEMll or 1GEM12 vectors (Promega) according to the 
35 supplier's protocol. My801B6 and 325M3 appeared to have intact human DNA inserts, based 
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on comparative pulsed field gel mapping of the YACs and BACs across the region (data not 
shown). 

Fluorescence in situ hybridisation 
5 FRAl6D-expressing metaphases were obtained from peripheral blood lymphocytes by 
standard methods. Briefly, cultures were grown for 72 hours in Eagle's minimal essential 
minimal medium, minus folic acid, supplemented with 5% fetal calf serum. Induction of 
FRA16D was with 0.5uM aphidicolin (dissolved in 70% ethanol) added 24 hours before 
harvest (32). DNA clones were nick-translated with biotin-14-dATP, pre-associated with 

10 6ug/ul total human DNA, hybridised at 20ng/ul to metaphase preparations, and detected with 
one or two amplification steps using biotinylated anti-avidin and avidin-FITC as previously 
described (33). Hybridisation signal was visualised using an Olympus AX70 microscope 
fitted with single pass filters for DAPl (for chromosome identification), propidium iodide (as 
counterstain) and FITC. FRA16D-expressing chromosomes were scored for signal only 

1 5 when the width of the fragile site gap was greater than the width of one chromatid, so that 
signal was unambiguously proximal or distal to the gap (Figure 3). Only fluorescent dots 
which touched chromatin were scored as signal - the few fluorescent dots which lay within the 
fragile site gap but did not touch proximal or distal segments were therefore not scored as 
signal since there was a possibility that they comprised non- specific background. Lambda 

20 clones which gave very poor FISH results (high non- specific hybridisation to other 

chromosomes) were not able to be scored with respect to the fragile site. This is likely to be 
due to the large amount of repetitive DNA within these particular clones - see below. 

Tumour cell lines 

2 5 The tumour cell lines LoVo, HT29, Kato in, S W480. AGS, MDA-MB-436 and LS 180 were 

purchased from the American Type Culture Collection. LoVo and AGS cells were grown in 
Hams F12 medium with 2mM L-glutamine, 10% fetal calf serum in 5% CO2, Kato EI cells 
were grown in RPMI1640 medium with 2niM L-glutamine, 20% fetal calf serum in 5% CO2, 

HT29 cells were grown in McCoy's 5a medium with 1.5mM L-glutamine, 10% fetal calf 

3 0 serum in 5% CO2, LS180 cells were grown in Eagle's minimal essential medium with 2mM 

Lglutamine and Earle's salts and non-essential amino acids, 10% fetal calf serum in 5% CO2, 
SW480 cells were grown in Leibovitz's L15 medium with 2mM L-glutamine and 10% fetal 
calf serum, MDA-MB-436 cells were grown in Leibovitz's L15 with 16mg/ml glutathione and 
0.026units/ml insulin. 

35 
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PCR detection of homozygous deletion in tumour cell DNAs 

PCRs for the detection of individual sequence tagged sites from across the FRAI6D region 
were duplexed (34) with control PCRs from the dystrophin gene on the X chromosome (DMD 
Pm or DMD49, ref 35) or the APRT gene on chromosome 16 (33). This allowed verification 
5 that the PCR reaction was working in the absence of a FRA16D region PCR product (Figure 
4). Suitable PCR primers for Alu29, 17Sp6, Alu20. 178poly. 5. 1 A6, RD69. IM7 were used 
or for 5(MCA, forward 5*- AACACAGCTCnTATCACATCC- 3* (SEQ ID No 6). reverse 5'- 
TGGCTGTAmGTCAGAACrrG- 3' (SEQ ID No 7); while others were as given in database 
accessions, D16S518 (GenBank Z24645), Afma336yg9 (GDB 1222843), WI2755 (GenBank 

10 G03520), STSG-10102 (GenBank Z23147), D16S3029 (GDB 605884). WI-17074 

(G22903), IM9 (GenBank R05832), D16S3096 (GenBank ), D16S516 (GDB 200080). 
PCRs for GenBank AA368108 (forward 5'-TAATCCTCAGCCTCTAGAATGCCT-3' (SEQ 
ID No 8), reverse 5*- GTATGATGATTTTCAGGGAGAAAC-3') (SEQ ID No 9)and 
GenBank AA398024 (forward 5'- TGTCCTCAACTGATTCTTACAAAC-3 (SEQ ID No 10), 

15 reverse 5*-TCAATGGGTTAGGCACAGACC- 3* (SEQ ID No 1 1)) were derived from partial 
sequence analysis of BAC353B15. Control PCRs for FRA3B deletions were D3S1234 (GDB 
186387), D3S1300 (GDB 188420) and D3S1841 (GDB 254090). 

Results 

2 0 Positional cloning ofFRA 16D 

A contig of YAC clones was established in the I6q23.2 region between markers c306D2 and 
c307Al2 which were found by FISH to map proximal and distal to FRAI6D, respectively 
(Figure IB). The individual YACs from this contig were also used as hybridisation probes to 
further localise the fragile site. These experiments identified the YAC 801B6 as spanning 

2 5 FRA16D, and therefore this YAC was used as a source of DNA for subcloning the region to 
provide shorter DNA fragments for further refinement of the fragile site position. In addition, 
BAC clones were identified from the region to provide redundancy of cloned human DNA in 
an effort to avoid potential problems of instability of human DNA in YACs, as has previously 
been noted for. other fragile site regions, including FRAXA (37), FRAIOB (38 and O. Handt, 

30 pers. comnu) and a Chinese hamster aphidicolin inducible fragile site region (39). 

A pulsed-field gel restriction map of YAC 801B6 was constructed by using ////ic// restriction 
fragment subclones of the YAC for use as hybridisation probes (H13m, H22s, H23m, H29m 
and H40m) (Figure 2A). The position of the BACs (379C2, 325M3 and 353B15) with 
35 respect to the YAC restriction map was determined by both the restriction mapping of the 
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B ACs and the positioning of common markers by PGR or hybridisation (Figure 2A). The 
STS (D16S518, Afma336yg9, WI2755. STSG-10102 and D16S3029) content of the YACs 
and BACs was also determined to assist in map construction. 

5 Subclone libraries of DNA from YAC 801B6 and BAG 325M3 were generated using the 

lambda vectors 1GEM12 and IGEMll (Promega), respectively and assembled into a contig by 
end-fragment hybridisation and restriction mapping. The integrity of the YAC restriction map 
was verified by comparison with that of the BAGs, 325M3 and 353B 15. For the region 
between the BAGs the integrity was verified by the use of long range PGR using human 
10 chromosomal DNA as template, (data not shown). 

Localisation of FRA16D by fluorescence in situ hybridisation (FISH) 

There have been difficulties in determining the precise localisation of common chromosomal 

fragile sites using FISH (refs FRA3B (13, 40,41,42), FRA7G (18,19) and FRA7H (43). 

15 The FISH data have been interpreted as due to the fragile sites being spread out over long 
DNA sequences (eg lOO's of kb) or that there are multiple fragile sites at a single locus. An 
alternative explanation is that the DNA in the immediate vicinity of the fragile site is not tightly 
'packaged* into chromatin. We therefore chose to score only those chromosomes where the 
width of the gap or break at the FRA16D fragile site was greater than that of one chromatid 

20 (Figure 3). This approach was intended to reduce the possibility that the 'unpackaged fragile 
site DNA* might be looping back over the distant side of the fragile site and therefore give a 
false 'spanning* signal - particularly for probes that are very close to or within the fragile site 
region. In addition, while the use of pre-reassociation in the hybridisation process 
dramatically improved the signal to noise ratio, it did render repeat rich regions poor 

25 hybridisation probes. This was particularly evident in the FRA16D region where there is an 
abundance of DNA repeat sequences of various kinds. 

The results of the FISH experiments are plotted in figure 4. The closest clearly proximal 
probe to FRA16D is 11-44 while the closest unequivocally distal probe is 1433. These probes 
3 0 map at a distance of "200kb apart. However, this 200kb region includes consistent scatter of 
distal signal around 11-38 and 11-27 and the poor hybridisation between 1181 and 1511 (due to 
repetitive DNA content). Therefore this 2(K)kb defined by FISH analysis is likely to be the 
maximum sequence required to define FRA16D rather than provide any evidence that the 
fragile site is spread over such a distance. 
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Detection of homozygous deletion in tumour cell lines 

The FRA3B fragile site - FHTT gene intron 4 region is a frequent site of deletion in various 
types of cancer (8). Homozygous FRA3B deletions have been detected in various human 
adenocarcinoma cell lines including (gastric) AGS, Kato HI; (breast) MDA- MB-436; (colon) 
5 LoVo, HT29, SW480 and LSI 80 (8). Since these deletions are somatic events that 

presumably occur as a result of exposure of these cells to certain environmental factors (11), 
we chose to analyse tumour cell lines w^hich exhibit FRA3B deletions for the presence of 
homozygous deletion at the FRA16D locus. 

10 STSs that were either mapped to the FRAJ6D region (Figure 1) or generated from partial 
sequence analysis through the region (data not shown) were used to screen for homozygous 
deletion in various tumour cell line DNAs. The STSs were duplexed with a PGR from the 
dystrophin locus, as an internal control. The results for the analysis of one of the FRA16D 
region markers, STSG-10102 is shown in figure 4. Of the seven tumour cell lines tested, the 

15 stomach tumour cell line AGS was found to be homozygously deleted at STSG-10102 and a 
series of contiguous markers through the region, (Table 1) thus suggesting the presence of 
minimal deletions spanning the FRA16D region in each chromosome 16 present in the AGS 
cell line. 

2 0 Detection of heterozygous deletion in AGS tumour cell line DNA 

The maximal extent of heterozygous deletion in the AGS tumour cell line in the FRA16D 
region was determined by genotyping polymorphic markers. The markers D16S518 and 
D16S3029 both gave two alleles indicating proximal and distal outer limits to the deletion of 
either chromosome 16 in AGS cells (Figure 2 A). The markers Afma336yg9 and 504CA were 

2 5 uninformative and therefore did not aid in delineating the limits of heterozygous deletion. 

Open reading frames of 372 (FOR 1), 423 (FOR II), 198 (FOR HI) and 45 (FOR IV) amino 
acids were obtained for the respective mRNA sequences (Figure 7). Identical N-termini, 
unique C-termihi. 

30 WW domains were identified by ProfileScan searches (at httpy/www.expasy.ch/prosite/). 
Discussion 

The region in which the chromosomal fragile site FRA16D is located has recently been shown 
to be associated with two types of chromosomal instability in cancer. In multiple myeloma, 

3 5 translocation of Ig loci into the 16q23 region causes the dysregulation of the c-MAF proto- 
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oncogene on the affected allele. While these breakpoints are spread over at least 500kb they 
bracket both the c-AfAF gene and the FRA16D fragile site (1 and figure I). The dysregulated 
expression results in elevated c-MAF mRNA levels, which is thought to contribute to 
neoplasia. These translocations were not identified by conventional cytogenetic analysis. 
5 Their detected frequency in multiple myeloma cell lines suggests an incidence of -25%. 

Using representational difference analysis to identify differences between the genomes of 
normal and tumour cells, the FRA16D region has also been shown to be the site of 
homozygous deletion in three different types (lung, ovary and colon) of adenocarcinoma (29). 

10 The commonly deleted region includes FRA16D, with the minimal deletion in colon tumour 
cell line corresponding almost exactly to the '-2(X)kb region shown by our FISH studies to 
span the FRA16D fragile site. If common aphidicolin fragile sites confer susceptibility to 
mutagen induced DNA instability in cancer then tumour cell lines which have been shown to 
have such instability at one fragile site are likely to exhibit instability at another fragile site. By 

15 analysing tumour cell lines with known FRA3B deletions, we have found that the AGS cell 
line derived from a stomach cancer exhibits homozygous deletion spanning FRAJ6D, 
Heterozygosity of the flanking markers Di6S518 and D16S3029 indicates that the 
chromosome 16 deletions are confined to the immediate vicinity of FRA16D, 

20 Taken together these deletion data confirm the hypothesis that FRA16D is associated with 
specific chromosomal instability in cancer. 

Given that the observed deletions are homozygous they are therefore likely to represent the 
loss of a negative function (eg tumour suppressor) rather than the gain of a tumour promoting 

25 function. If the analogy with the FRA2B locus holds then a gene either spanning or, at least 
partially, within the FRA16D commonly deleted region may contribute to neoplasia as a 
consequence of quantitative and/or qualitative effects of the deletion. Alternatively, the 
proximity of the FRA16D deletions to the c-MAF gene suggests that they have the potential to 
affect C'MAF expression. The FRA3B fragile site is associated with a region of *late* 

30 replication (48) as are the 'rare* fragile sites FRAXA and FRAXE (49,50). Assuming that 
replication timing is affected by proximity to fragile site loci and, given the coupling of 
replication with transcription, the deletion of the FRA16D region may lead to an alteration in 
the liming, with respect to the cell cycle, of the expression of genes in the area - including c- 
MAF. 



35 
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ABBREVIATIONS BAG, bacterial artificial chromosome; DAPI, 4',6-diamindino-2- 
phenylindole; FISH, fluorescence in situ hybridisation; FITQ fluorescein isothiocyanate; 
LOH, loss of heterozygosity; FHTT, firagile histidine triad; FRA, fragile site locus; PGR, 
polymerase chain reaction; STS, sequenced tagged site; YAC, yeast artificial chromosome 

5 

EXAMPLE 2 - DNA SEQUENCING OF THE FRA16D FRAGILE SITE AND THE FOR 
GENE, 

MATERL^lLS and METHODS: 

10 

Cell lines 

Cell lines AGS, HCT116, HS578BST. HS578T, LS180, MDA-MB-453 and T47D are from 
the Department of Cytogenetics and Molecular Genetics, WCH collection and were originally 
obtained from the American Type Culture Collection or the European Collection of Cell 
15 Cultures. AGS and LS180 cells were grown as described in Example 1. HS578BST cells 
were grown in OPTI-MEM with L-Glutamine, O.Olmg/ml epidermal growth factor, 0.5mg/ml 
hydrocortisone, 8% fetal calf serum in 5% CO2. T47D, MDA-MB-453 and HS578T cells 
were grown in RPMI 1640 with L-glutamine, 10% fetal calf serum in 5% CO2. 

20 Large scale sequencing ofFRA16D 

Sequencing of the 270kb region spanning FRA16D consisted of 

a) Sonication libraries and 

b) Nebulization libraries of BAC clones 325M3 and 353B15 and 

c) Restriction fragments of 1 clones (for sequencing between BAC 325M3 and BAC 353B15). 

25 

a) Construction of sonication libraries: 

For DNA sonication and cloning we modified the protocol from the Sanger Centre (http:/ 
/www.sanger.ac.uk/Teams/Team53/sonication.shtml): 

30 1 mg of each B AC-DNA were sonicated in 300 ml H2O and 8 ml lOx Mung Bean Buffer 
(500mM NaAc. 300 mM NaCl, 10 mM ZnS04 pH 5.0) on ice for 20 seconds using the 

Ultrasonic Inc. Heat Systems Sonicator W-225 (50% duty, 3.5 power). After reducing the 
volume to 80 ul, blunt ends were created with adding 40 U of Mung Bean Nucleases 
(Biolabs) and incubating the mixture at 30 "C for 25 minutes. The products were size 
35 fractioned on a 1% agarose gel and fragments ranging from 0.7-2 kb were extracted with the 
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Qiaquick Gel Extraction Kit (Qiagen). 1500 ng of sonicated DNA (used in 500 ng aliquots) 
were ligated into pUC18-Sma vector (Pharmacia) at 16 °C overnight and transformed into 
Sure cells (electroporation-competent, Stratagene). 600 and 1500 clones of the sonication 
libraries of BAC 325M3 and 353B15, respectively, were gridded on 96well plates and 
5 sequenced in one direction using the Ml 3-forward primer. Sequences were assembled into 
contigs using the Staden Package (MRC) on an UNIX computer and edited in LASERGENE 
(Macintosh). For a selected number of clones additional sequences with the M13-reverse 
primer were retrieved and assembled. Additional sequencing primers were designed and 
PCR-products sequenced to close gaps between contigs. 

10 

b) Construction of nebulization libraries: 

10 mg of each BAC DNA was mixed with 200 ml lOx TM buffer (500 mM Tris-HCl, pH 
7.5, 150 mM MgCl2), 1 ml sterile glycerol and H2O added to 2 ml. The mixture was pipetted 

into an IPI-nebulizer and nebulized at lOpsi for 45 seconds. The nebulized DNA was then 
15 precipitated, end-repaired, size-fractioned and cloned as described for the sonicated DNA. 
300 and 500 nebulized clones of BAC 325M3 and 353B15, respectively, were sequenced as 
described above and included in the assemblies. Subclones for sequencing of BAC 353B15 
were picked randomly, whereas BAC 325M3 subclones were selected after hybridisation of 
specific 1-clones of the tile path, made from the BAC 325M3. 

20 

c) Subcloning of restriction fragments of Iclones between 1-32 and 1-191 was done in pUC19- 
vector. Clones were sequenced with Ml 3-forward and M13-reverse primers as well as with 
sequence-specific primers. In some cases subclones derived from specific restriction 
fragments were also subject to sonication, shotgun cloning and sequencing. 

25 

Sequencing was performed with the ABI Big Dye Terminator Kit from Perkin Elmer. In cases 
where sequencing with the Big Dye Terminator Kit failed, dRhodamine Terminator Kit was 
used, as recommended for GT-rich or homopolymeric regions by the ABI- DNA sequencing 
guide. 

30 

The final sequence was analysed using: 
BLAST (http://www.ncbi.nlm.nih.gov/BLAST), 

REPEATMASKER (http://ftp.genome.washinglon.edu/cgi-bin/RepeatMasker), and 
GENSC AN (http://CCR-08 1 .mit.edu/GENSCAN.html). 



35 
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Northern blot hybridisation 

Probes for hybridisation on multiple tissue northern blots from Clontech were: 

a) exon 7 (186 bp), positions 690 through 876 of AF227526 

b) part of exon 9A (779 bp), positions 1 182 through 1961 of AF227527 
5 c) exon 3-6A (366 bp), positions 291 through 657 of AF227528 

d) part of exon lA (163 bp), positions 298 through 461 of AF227529. 

RNA Extraction 

RNA was extracted from IxlO'^ cells for each of the cell lines using the RNeasy Mini Kit from 
10 Qiagen: The cells were disrupted by addition of 600 ul lysis buffer RLT (supplied with the 
Kit). The lysed cells were homogenised by passing 5-10 times through a 21G (0.8x38 mm) 
needle attached to a 5 ml syringe. 600 ul of 70% ethanol were added and the samples were 
applied to RNeasy Mini Spin columns. Purification and elution of the samples were carried 
out according to the Kit's manual. 35-98 ug of total RNA were obtained. 

15 

RT'PCR 

Reverse transcription was carried out in a 40 ul reaction volume using 12-33 ug of total RNA 
from cell lines AGS. HCT116, MDA.MB.453, LS180, T47D, HS578T and HS578BST, 
respectively, according to the product sheet of Gibco BRL Superscript RNAse H- Reverse 
20 Transcriptase Kit except for the addition of 20 U RNAse inhibitor (Rnasin, Promega) to the 
mixture. 

Aliquots of 100 ng of cDNA were amplified in PGR reactions using various cDNA- primer 
combinations under standard PGR conditions (10 cycles of 94 "^G for 30 sec, 60 "G for 30 sec, 
25 72 "G for 30 sec, then 25 cycles of 94 'G for 30 sec, 55 "G for 30 sec, 72 ''G for 30 sec). 

Primers (5*-3') used in RT-PGR were: 

a) JIHCMA-F (ATCTTGGGGTGGACjGAAGATGGGA) (SEQ ID No 12)and 
wb85-F (TTArrGTGGA GTTTTGTGGCGGAG) (SEQ ID No 13), FORHI specific 
30 b) FOR-ex3 (GAAGAAGAAAGTGATGAGAAGGGA) (SEQ ID No 14)and 
wb85-F. FORin specific 

c) wb85-E12 (TTAGTAGGCGAATGAGAGCGAGGA) (SEQ ID No 15)and 

wb85-A (TGAATTAGGTGCAGTGAGGAGAAG) (SEQ ID No 16), common in FORI. FOR 
n and FOR IE 

35 
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5' RACE 

Complete 5*-ends of transcripts FORI, FORI! and FORIH were determined by 5' RACE 
experiments including first strand cDNA synthesis, purification, TdT tailing of the cDNA, 
PCR of dC-tailed cDNA and nested amplification according to the instruction manual of 
GibcoBRL. 1 ug of total RNA of cell lines HS578BST (normal) and T47D (tumour) were 
taken as templates. First strand cDNA synthesis was conducted with the following specific 
GSPl primers: 

FORI (coxido-R, 5'-TTAnTCAGCACTCAGCTCAAAGTCAC-3') (SEQ ID No 17), 

FORIl (HHCMA-B, 5'-AGCAAAGAGACCTATGCCTAGCCCA-3') (SEQ ID No 18), 

FORIII (wb85-F, 5'-TTATTCrGCACTTTTCTGGCGGAG.3*) (SEQ ID No 13). 

PCRs of the dC-tailed cDNA were carried out with the GSP2-primers: 

FORI and FORH (coxido-32, 5'-ATATCTGTA AATCGATGGGACTCTG-3*) (SEQ ID No 

19), 

FORm (wb85-A, 5'-TGAATTAGCTCCAGTGACCACAAC-30 (SEQ ID No 16). 
Nested amplification was done with 5 ul of a 1 : 100 dilution of GSP2-PCR products and the 
GSP3-primers: 

FORI and FORII (coxido-21, 5*-ACATGAAGAGGCACATTCTTGGCCT-3') (SEQ ID No 
20) 

and FORIII (wb85-E, 5'-TCCTCGGTGTGATTGGCGTAGTAA-3 ) (SEQ ID No 21) in 
combination with the AUAP-primer (GibcoBRL) (SEQ ID No 21). 
PCR-products were extracted with Qiaquick-Kit from agarose-gels after electrophoresis 
and sequenced directly with GSP3-primers and the primer tj96-C: 
5'-GGAGGCAGCTCGTCCTCACTG-3* (SEQ ID No 22). 

5' RACE 

The 3' RACE System for Rapid Amplification of cDNA Ends (Gibco BRL) was used to 
determine the alternatively spliced 3*-ends of transcripts encoding FORI. 3mg of total RNA 
of the normal fibroblast cell line SF4635 and the tumour cell lines AGS and HCTl 16 were 
taken as templates for first strand synthesis. Instead of the adapter primer (AP) supplied with 
the kit, the following variant of this primer was used: 

RACE-APA^AR (S'-GGCCACGCGTCGACTAGTACGTACAGTfTIT} 5T-3'). 
This allowed a nested PCR approach in the subsequent PCR reactions. The target cDNA was 
amplified with a primer overlapping the FORI exon 8 / exon 9 boundary (5'- 
ACCAAGTCCATGGTTTCAGACTG-3') and a RACE-NESTED primer 
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(5'-CGTCGACTAGTACGTACAGT-3*). A second round of amplification was performed 
with exon 9 specific primer #9327 

(5'-ACTGCCTGGTAGAAGGAGGTCACTTCT-3') and the Abridged Universal 
Amplification Primer (AUAP. 5' GGCCACGCGTCGACTAGTAC-3') supplied with the 3'- 
5 RACE kit. 1ml of first round PGR product was used for the nested PGR reaction. Bands 
were cut out from agarose gels, purified with Gene Elute Gel Purification Kit (Sigma) and 
directly sequenced with primer #9327, 

Chromosomal DNA sequences corresponding to the alternative exons 10, lOA and lOB were 
identified by BLAST searches of sequence databases. Exon 10 was located in GenBank 
10 AC009141. exon lOA in GenBank AF179633 and exon lOB in GenBank AF009145 (see 
Figures 6 and 10). 

cDNA sequence of FOR IV (AF227529) 

The preliminary cDNA sequence of the FOR IV transcript is incomplete at its 5' end at 
15 this stage. The sequence deteraiined so far derives from overlapping EST-clones 
qf42f03xl (AI149681) and tm79cll.xl (AI570665). The latter was sequenced 
additionally with the internal primer tj96-C 
(5'-GGAGGCAGCTCGTCCTCACTG-3 ) (SEQ ID No 22). 

2 0 Determination of breakpoints in cell lines AGS and HCT116 

Deletions in cell lines AGS and HCTl 16 were determined in duplex-STS-PCR reactions as 
described in example 1. All primers are listed from 5'->3* in Table 1. 

Foiir regions of homozygous deletion (referred to asHZD I - HZD IV) were detected in the 
25 AGS cell line. The proximal breakpoint for HZD I in AGS was narrowed down to 654 base 
pairs between STSs 16D-15/16D-36 (+) and 16D-1/16D-60 (-); the distal breakpoint of HZD I 
of 3962 base pairs is between STS 16D-70 (-) and 16D-47 (+). The proximal breakpoint for 
HZD II in AGS was narrowed down to 3030 base pairs between STSs 16D-57 (+) and 16D- 
67 (-); the distal breakpoint of HZD II of 1720 base pairs is between STS 16D-68 (-) and 16D- 
30 54 (+). The proximal breakpoint for HZD III in AGS was narrowed down to 209 base pairs 
between STSs 16D-51 (+) and 16D-55 (-); the distal breakpoint of HZD III of 5690 base 
pairs is between STS 16D-202 (-) and 16D-69 (+). The proximal breakpoint for HZD IV in 
AGS was narrowed down to 5179 base pairs between STSs 16D-30/16D-44 (+) and ETAI (- 
); the distal breakpoint of HZD /V of - 1500 base pairs is between STS IM7 (-) and 410S 1 A 
35 (+). 
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Two regions of homozygous deletion (referred to SisHZD I and HZD IT) were detected in the 
HCTl 16 cell line. The proximal breakpoint for HZD I in HCTl 16 was narrowed down to 
1835 base pairs between STSs 16D-19 (+) and 16D-61 (-); the distal breakpoint of HZD I of 
5 1549 base pairs is between STS 16D-62 {-) and qzl9hl 1 (+). The proximal breakpoint for 
HZD II in HCTl 16 was narrowed down to 422 base pairs between STSs 16D-63 (+) and 
16D-30 (-); the distal breakpoint of HZD //of 1513 base pairs is between STS 16D-66 (-) and 
801A(+). 

10 For determining the presence of exon 9 of FOR I (5 1 bp) in the AGS cell line a duplex PCR 
with genomic primers from the dystrophin gene (DMD) as described in example 1 was carried 
out with primers 8040/ 8041 (Table 1). 

RESULTS 

15 

DNA sequence spanning FRA16D 

The DNA sequence spanning FRA16D was determined by a combination of approaches. 
Firstly, a tile path of lambda subclones of YAC My801B6 and BAC 325M3 was restriction 
mapped with restriction endonucleases EcoRI, Hindlll, BamHI and Sacl in order to provide a 

20 reference framework with which to anchor the DNA sequence. Secondly, either whole BAC 
DNA preparations of BAC325M3 or BAC353B15 or specific restriction fragments from the 
lambda subclone tile path were used as feedstock DNA for construction of random insert 
plasmid libraries. Sequences from the region between BAC325M3 and BAC353B15 (1 
subclone tile path 132 to 1 191) were subjected to long range PCR and restriction digest analysis 

25 in order to verify the integrity of this sequence. Sequenced subclones were also ordered by 
hybridisation with individual lambda subclones from the minimal tile path. The DNA 
sequences were therefore assembled in a directed rather than random manner. This approach 
greatly assisted in the assembly of those regions that were rich in DNA repeats. The 270kb 
contiguous sequence, with an average 4- fold sequence coverage, spanning FRA16D has been 

3 0 deposited in GenBank (accession number AF2 1 7490) (Figure 6). 

Relationship between deletion and translocation breakpoints and FRA16D 
PCR analysis of sequence tags across the FRA16D region was used to refine the location of 
deletion breakpoints in the AGS and HCTl 16 tumour cell lines (Figure 6). Both cell lines 
3 5 showed two distinct regions of homozygous deletion indicating a minimum of three deletion 
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events on the two chromosome 16s in each cell line. Four regions of the FRA16D spanning 
sequence were particularly difficult to determine because of their composition (as evident by 
DNA polymerase pausing in sequencing). Each of these sequences coincided with breakpoint 
regions in HCTl 16 or AGS tumour cell lines (Figure 6). The unstable regions consisted of: 
5 1) a polyA homopolymer region at 144 to 145kb of DNA sequence AF217490; 2) an 

imperfect CT-repeat of 320 base pairs at position 177-178kb; 3) an 8kb region at position 191- 
199kb encompassing a poly A homopolymer region followed by an AT-repeat; a polyT 
homopolymer repeat and two inverted (hairpin-forming) repeats and 4) a TG repeat followed 
by a homopolymer region (poly T) at 212-213kb. This fourth sequence is located within a 
1 0 common breakpoint region for the AGS and HCTl 1 6 cell lines at 2 1 1 .7 - 2 19.9kb of 

AF2 17490. PCR across each of the breakpoint regions in AGS and HCTl 1 6 cell lines using 
primers from positive flanking STSs failed to produce products suggesting that additional 
cryptic instability (e.g. inversions or amplifications) may also be present. 

15 The locations of three previously identified multiple myeloma breakpoints (1) was determined 
by either scanning of partial database sequences (for ANBL 6 (5',3') and JJN3) or by PCR of 
STSs on the tile path of lambda subclones spanning FRA16D (for MM.l). 

Alternatively spliced FOR gene spans fragile site FRA16D 
2 0 Scanning of the 270kb sequence spanning FRA16D by BLAST homology searches revealed a 
paucity of EST homologies. The exceptions were consecutive exons corresponding to 
sequences from the EST qg88f04.xl (Figure 6). These exons therefore locate FRA16D 
within a 260kb intron. BLAST searches with the qg88fD4.xl EST sequence revealed 
considerable overlap with clusters of ESTs the longest available sequence of which was 

2 5 HHCMA56 (U13395). ESTs qg88f04 and HHCMA56 clearly have distinct 3' end sequences 

and were therefore referred to as transcript I and transcript 11. Another cluster of ESTs 
(transcript HI) was found to share 5' but not 3' end sequences with transcripts I and IL A 
fourth cluster of ESTs (transcript IV) was found to share sequence homology, however this 
overlap is between the 5' most sequences of transcripts I - m and the 3' end of the EST 

3 0 cluster suggesting that it may represent an overlapping gene rather than another alternatively 

spliced transcript. 

5*RACE experiments using mRNA from normal (HS578BST) and tumour (T47D) cells were 
utilised to extend and confirm the sequences of the clusters of GenBank EST sequences of 
35 transcripts I - IV and to determine the organisation of the alternatively spliced mRNAs which 
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common event in tumour cells. Similarly, the loss of FORIU transcript is not common to all 
tumour cells as FORIII specific RT-PCR products were readily detected in both AGS and 
HCT116ce]ls (Figure 15). 

5 FOR encoded proteins 

The alternative spliced mRNAs transcribed from the gene each shov^ homology to the oxido- 
reductase superfamily of proteins. The open reading frames of the alternatively spliced FOR 
gene mRNAs I - III have a common N-terminus which contains a WW domain (Figure 10). 
The WW domain is truncated in FORTV open reading frame, however since this mRNA 
10 appears to originate from a distinct promoter it may well be that an upstream reading frame is 
utilised in this milNA. The open reading frame from the FOR III transcript retains the WW 
domain however it is truncated for approximately half the length of the oxido-reductase 
homology (Figure 10). 

15 

DISCUSSION 

Identification of the FOR gene spanning FRA16D 

Given the proposed role of the FHIT gene in mediating the biological consequences of 
FRA3B associated DNA instability in cancer cells we sought to identify the closest gene to 

20 FRA16D which might mediate the biological effects of FRA16D associated DNA instability in 
cancer. Sequence analysis of the FRA16D spanning DNA sequence revealed the FOR gene as 
the sole transcript in the immediate vicinity of the minimal region of homozygous deletion in 
cancer cells. Alternative exons of this gene were found to flank both the FRA16D fragile site 
and the tumour cell deleted regions - the alternative exon 9 being deleted in the AGS cell line. 

25 No additional authentic transcripts from within the FOR gene intron were evident. 

Differential expression of alternative spliced and aberrant FOR transcripts in normal and 
tumour cells 

RT-PCR and 5*-RACE gave differing patterns of FOR transcript expression in various normal 
3 0 and tumour cell lines. It will be of interest to determine whether there are differences in the 
ratio of FOR transcripts which are consistent with the biological characteristics of various cell 
types e.g. neoplastic state or metastatic potential. It is unlikely that the presence of FOR I 
transcripts will be a common property of tumour cells since at least the AGS cell line is 
homozygously deleted for the FORI exon 9. Additional aberrant FOR transcripts, including 
3 5 sequences fused to retroviral LTRs, were detected in tumour cells. 
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It may well be that the ratio of the various FOR transcripts is perturbed by DN A instability in 
the region and that it is the resultant alteration in relative abundance of the various FOR 
encoded proteins which mediates the biological consequences of DNA instability at FRA16D. 
5 For example the homozygous deletion in AGS cells deletes exon 9 of the FOR I transcript and 
may have an effect on the stability of the FOR II transcript, however this deletion is unlikely to 
have any direct effect on the FORHI transcript which terminates well outside the 
homozygously deleted region. 

1 0 Possible Junction of FOR and role in neoplasia 

The FOR encoded proteins show sequence homology to the oxido-reductase family of 
proteins and contain a WW domain. Other members of this family of proteins include the 
YES proto-oncogene associated proteins and NEDD- ubiquitin ligases. 

15 The open reading frame from the FORIII transcript retains the WW domain however it is 

truncated for approximately half the length of the oxido-reductase/ubiquitin-ligase homology 
(Figure 10). The FORIII protein is therefore likely to be able to bind proteins that recognise 
the common FORf and FORII WW domain but not able to perform the enzymatic function 
encoded by the FORI and FORII proteins (possibly ubiquitination). Such characteristics make 

2 0 the FORIII protein a likely competitor of FORI and/or FOR IL Since ubiquitination facilitates 
the process of specific protein turnover FORIII cou\d therefore act to prolong the half-life of 
its substrate by competing with FORI and/or FORIL Influencing this ratio may have 
therapeutic benefits. Thus the provision of reduced FORIII production by perhaps use of 
antisense to FORIII transcript may stabilise the balance. Alternatively over expression of 

2 5 FORI and/or FORI! could tip the balance the other way. 

WW domains are regions of protein-protein interaction that bind polyproline-rich motifs (PY 
domains) in specific partner proteins. Specificity in this interaction is determined by 
differences in particular amino acid in the various WW domains. Proteins known to bind to 

3 0 WW domains include the YES proto-oncogene product and p53 binding protein-2 (Pirozzi et 

aL (1997) /. BioL Chem 272, 14611-14616). Alteration in the relative levels of the FOR 
encoded proteins as a consequence of FRA16D associated instability is therefore likely to 
influence the biological function of the PY-motif containing-protein(s) which is (are) the 
normal binding partner that the FOR proteins share through their WW domain. 



35 
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The majority of deletions in the 16q23.2 region are heterozygous with the homozygous 
deletions being confined and limited in number. Cells which still have the capacity to produce 
FORII protein (from a normal chromosome 16 FOR allele) might have an elevated level of 
FORIIl (through FRA16D associated deletion of the other chromosome 16 allele) and 
5 therefore have a selective "heterozygote" advantage. 

The finding of aberrant FOR related transcripts spliced to retroviral RNA sequences in tumour 
cells that do not necessarily exhibit FRA16D homozygous deletion (e.g. MDA-MB-453, 
Figure 15) suggests that dysfunction of the pathway involving the FOR WW domain could be 
10 a common event in neoplasia perhaps through other forms of FRA16D related DNA instability 
such as DNA insertion or translocation. Three out of five previously mapped multiple 
myeloma translocations (21) map within the FOR gene suggesting that DNA instability at the 
FRA16D locus and aberrant expression of the FOR gene may have a variety of roles to play in 
various forms of cancer. 

15 

For the purposes of working the invention a large number of references to pertinent 
methodologies are set forth in the following US patent documents:- US 5981218 to Rio et al, 
US 5928884 to Croce et al, US 5945522 to Cohen et al, and US 5837492 to Tavtigian et al. 
These documents are incorporated herein entirely specifically for purposes of permitting 
2 0 working of the invention . 

For the purposes of this specification the word "comprising" means "including but not limited 
to", and the word "comprises" has a corresponding meaning. 

2 5 Reference in this specification to a document is not to be taken as an admission that the 
disclosure therein constitutes common general knowledge in Australia. 
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CLAIMS 

1 . A method of detecting variation of a 16q23.2 target, said method comprising the steps 
of contacting target nucleic acid with one or more oligonucleotide suitable for use as 

5 hybridisation probe or nucleic acid amplification primer specific for binding the 16q23.2 
specific target and ascertaining the binding of said oligonucleotide. 

2 . A method of detecting variation of a 16q23.2 target as in claim 1 wherein the 16q23.2 
specific target is selected from one or more of the group comprising the FOR gene, the 

10 FRA16D site, or mRNA encoding FOR protein, and the I6q23.2 target reflects chromosomal 
rearrangements or mutations. 

3 . A method of detecting variation of a 16q23.2 target as in claim 2 wherein the 16q23.2 
target is within the FOR gene and is selected from one or more of the group comprising exons 

15 lA, 1, 2, 3, 4, 5, 6, 6A, 7, 8, 9, 9A, 10, lOA, lOB or introns located between two adjacent 
exons. 

4. A method of detecting variation of a 16q23.2 target as in claim 3 wherein the 16q23.2 
target within the FOR gene is selected as either the intron between exon 8 and 9, or the intron 

20 between exons 8 and 9A. 

5 . A method of detecting variation of a 16q23.2 target as in claim 2 wherein the 16q23.2 
target is a pause site within the FRA16D. 

25 6 . A method of detecting variation of a 1 6q23.2 target as in claim 5 wherein the pause site 
is selected from the group consisting of i) a poly A homopolymer region at 144 to 145 kb of 
DNA sequence SEQ ID no 53, and ii) imperfect CT-repeat of about 320 base pairs at position 
177-178kb, iii) an approximately 8kb region at position 191-199kb encompassing a poly A 
homopolymer region followed by and AT repeat; a poly T homopolymer repeat and an two 

30 inverted repeats and iv) a TG repeat followed by a poly T homopolymer region at 212-213 kb. 

7 . A method of detecting variation of a 1 6q23.2 target as in claim 2 wherein the target is a 
breakpoint of one or more chromosomal rearrangements associated with a tumour. 
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8 . A method of detecting variation of a I6q23.2 target as in claim 2 wherein the target is 
an oligonucleotide sequence including a point mutation or small DNA rearrangement 
associated with a tumour. 

5 9. A method of detecting variation of a 16q23.2 target as in claim 2 wherein the 16q23.2 
target is within the FOR gene and is selected from one or more of the group comprising exons 
lA, 1, 2, 3, 4, 5, 6, 6A, 7, 8, 9, 9A, 10, lOA, lOB. 

10. A method of detecting variation of a 16q23.2 target as in claim 2 wherein the target is 
10 any one of the splice variants FOR I, FOR II, FOR HI or FOR IV. 

11 . A method of detecting variation of a 16q23.2 target as in claim 2 wherein the method 
consists of determining the level of expression of the FOR gene or any one or more exon 
thereof, by determining the level of mRNA expression using a probe specific for the FOR 

15 gene or exon thereof. 

12. A method of detecting variation of a 16q23.2 target as in claim 1 1 wherein the target is 
selected from the group consisting of exons 6A, 1 A, 9, 10, lOA, lOB and 9A and the method 
is used to give an indication of relative amounts of transcription of the FOR I, FOR IV, FOR 

20 II and FOR IE splice variants. 

13. A method of detecting variation of a 16q23.2 target as in claim 1 1 wherein the target is 
selected from the group consisting of the 6A, 9, 10, lOA, lOB and 9A exon and the method is 
used to give an indication of relative amounts of transcription respectively of the FOR I, FOR 

2 5 IV, FOR II and FOR HI splice variants. 

14. A method of detecting variation of a 16q23,2 target as in claim 13 wherein the method 
measures the level of mRNA expression of FORIH when compared to the level of FORE 
and/or FORI. 

30 

15. A method of detecting variation of a I6q23.2 target as in claim 2 using a plurality of 
distinctly binding oligonucleotides selected to bind to a plurality of corresponding 
chromosomally spaced apart targets to one or more change in said plurality of targets. 
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16, A method of detecting variation of a I6q23.2 target as in claim 15 wherein separate 
ones of the plurality of distinct oligonucleotides are held spatially separated on a physical 
support to provide allow for separately detecting the binding of each one of the distinctly 
binding oligonucleotides. 

5 

17. A method of detecting variation of a 16q23.2 target as in claim 2 including a 
preamplification step whereby the target nucleic acid is amplified before binding of the 
oligonucleotide. 

10 18. A method of detecting variation of a 16q23.2 target as in claim 2 consisting of a PCR 
method wherein two oligonucleotides being PCR primers are used to contact the target 
followed by an amplifications step at least one of the oligonucleotides binding the target. 

19. A method of detecting variation of a I6q23.2 target as in claim 2 wherein the physical 
15 form of the target nucleic acid is selected from the group consisting of chromosomal DNA, 

cDNA and mRNA. 

20. A method of detecting variation of a 16q23.2 target as in claim 2 wherein the target is 
chromosomal and the method comprises detecting the heterozygosity or homozygosity for one 

20 or more variants in the 16q23.2 target. 

21. A method of detecting variation of a 16q23.2 target as in claim 20 wherein the method 
includes the steps of providing a first set of one or more oligonucleotides and a second set of 
one or more oligonucleotides the fu'st set of oligonucleotide being specific for a first variant of 

25 the target nucleic acid, the second set of oligonucleotides being specific for a second variant of 
the target nucleic acid, the first and second set of oligonucleotides being labelled so as to be 
capable of being distinguished, and the method comprising the steps of comparing the 
proportion of binding of the first and second set of oligonucleotides. 

30 22. An isolated 16q23.2 nucleic acid molecule selected from the group consisting of 

a) FRA16Dsite, 

b) FOR gene, 

c) mRNA of the FOR gene, 

d) cDN A of the FOR gene. 
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e) variants of the above including, chromosomal rearrangements and mutations of 
sequences set out in a) to d) including those variants associated with cancers, 
and 

f) nucleic acid sequences capable of hybridising specifically to any sequence of a 
to e or its complement under stringent hybridisation conditions. 

23 An isolated 16q23.2 nucleic acid molecule as in claim 22 comprising an antisense 
molecule. 

24 An isolated 16q23.2 nucleic acid molecule as in claim 22 capable of acting as a specific 
primers and probe for detecting cancer associated variations of DNA sequence selected from 
the group consisting of 

g) a point mutation or small DNA rearrangement associated with a tumour. 

h) a breakpoint of one or more chromosomal rearrangements associated with a 
tumour, and 

i) a pause site within the FRA16D 

25, A recombinant 16q23.2 nucleic acid molecule including a vector and a 16q23.2 nucleic 
acid sequence operably linked to a control element, wherein the 16q23.2 nucleic acid sequence 
is selected from the group consisting of: 

a) FRA16Dsite, 

b) FOR gene, 

c) mRNA of the FOR gene, 

d) cDNA of the FOR gene, 

e) variants of the above including, chromosomal rearrangements and mutations of 
sequences set out in a) to d) including those variants associated with cancers, 
and 

0 nucleic acid sequences capable of hybridising specifically to any sequence of a 
to e or its complement under stringent hybridisation conditions. 

26. A recombinant 16q23.2 nucleic acid molecule as in claim 25 including one or more 
exons of the FOR gene, wherein the vector is an expression vector and the 16q23.2 nucleic 
acid sequence is aligned to produce or overproduce FOR proteins or variants thereof. 
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27. A recombinant 16q23.2 nucleic acid molecule as in claim 26 wherein the I6q23.2 
nucleic acid sequence encodes a splice variant of the FOR protein selected from the group 
consisting of FOR I, FORII and FORIU. 

5 28. A recombinant 16q23.2 nucleic acid molecule as in claim 27 wherein the 16q23.2 
nucleic acid sequence encodes a splice variant of the FOR protein selected from the group 
consisting of FORII and FORIII. 

29. A recombinant 16q23.2 nucleic acid molecule as in claim 25 wherein the recombinant 
10 vector produces an antisense molecule capable of blocking the expression of a splice variant of 

the FOR protein. 

30. A recombinant 16q23.2 nucleic acid molecule as in claim 25 wherein the recombinant 
vector produces an antisense molecule capable of blocking the FORIU protein. 

15 

31. A purified protein encoded by a gene which is adjacent to or overlapping a 
chromosomal fragile site including a string of amino acids unique to a FOR protein as set out 
in SEQ ID No 32, SEQ ID No 33, SEQ ID No 34 or SEQ ID No 35, said amino acid string 
being at least 10 amino acids long and exhibiting at least 70% amino acid homology to any one 

20 of SEQ ID No 32, SEQ ID No 33, SEQ ID No 34 and SEQ ID No 35. 

32. A purified protein encoded by a gene which is adjacent to or overlapping a 
chomosomal fragile site as in claim 3 1 wherein the amino acid string exhibits at least 90% 
homology to any one of SEQ ID No 32, SEQ ID No 33, SEQ ID No 34 and SEQ ID No 35. 

25 . 

33. A purified protein encoded by a gene which is adjacent to or overlapping a 
chomosomal fragile site as in either claim 31 or 32 wherein the amino acid string is at least 20 
amino acids long. 

30 34. A purified protein encoded by a gene which is adjacent to or overlapping a 

chomosomal fragile site as in either claim 31 wherein the protein has an oxidoreductase 
domain and/or one or more WW domains. 

35. A purified protein encoded by a gene which is adjacent to or overlapping a 
3 5 chromosomal fragile site as in claim 34 having at least one WW domain having an amino acid 
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String of 10 amino acids or greater with homology of greater than 70% with an amino 
sequence selected from the group comprising the region 16 to 49 or 57 to 90 of the FOR gene 
being the amino acid strings 

DELPPGWEERTTKDGWV Y YANHTEEKTQWEHPKT (SEQ ID No 4) and 
5 GDLPYGWEQETDENGQVFFVDHINKRTTYLDPRL (SEQ ID No 5). 

36. A purified protein encoded by a gene which is adjacent to or overlapping a 
chromosomal fragile site as in claim 3 4 having an oxidoreductase domain having an amino 
acid string of 10 amino acids or greater with homology of greater than 70% with an amino 

10 sequence selected from the group comprising the region 130 to 156 or 204 to 247 or 293 to 
324 of the FOR gene being the amino acid strings 

TGANSGIGFETAKSFALHGAHVILACR SEQ ID, 

LH VLVCNAATFALPWSLTKDGLETTFQ VNHLGHFYLVQLLQDVL SEQ ID, 
YNRSKLCNILFSNELHRRLSPRGVTSNAVHPG SEQ ID 

15 

37. A purified FOR protein, or mutation, or splice variation thereof encoded by any two or 
more exons selected from the group comprising lA, 1, 2, 3, 4, 5, 6, 6A, 7, 8, 9, 9A, 10, 
lOA, lOB joined. 

20 

38. A purified FOR protein as in claim 37 selected from the group consisting of FORT, 
FORII, FORIII, or FORIV. 

39. A purified FOR protein as in claim 37 being FORI. 

25 

40. A purified FOR protein as in claim 37 being FORII. 



41, A purified FOR protein as in claim 37 being FORIII. 

30 42. An agent capable of selectively binding a FOR protein or fragment or variant thereof. 

43 . An agent capable of selectively binding a FOR protein as in claim 42, having a binding 
specificity to a splice variant of a FOR protein. 
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44. An agent capable of selectively binding a FOR protein as in claim 43 said agent capable 
of specifically binding to the C terminus of one of the splice variants selected from the group 
consisting of FOR I, FOR II, FOR lH and FOR IV to distinguish between said one from 
others of the splice variants. 

5 

45 . An agent capable of selectively binding a FOR protein as in claim 43 wherein the FOR 
protein is the FORIII splice variant and said agent also inhibits at least one intermolecular 
interaction with the FORIH . 

10 46. An agent capable of selectively binding a FOR protein as in claim 42 wherein the agent 
is an antibody or fragment thereof. 

47. A method of detecting variants of the FOR protein comprising contacting a test sample 
with one or more FOR protein binding agents capable of distinguishing between variants of 

15 the FOR protein, and detecting the binding of said agent. 

48. A method of detecting variants of the FOR protein as in claim 47 the method including 
the quantitative measurement of one or more FOR protein variants in the test sample to give a 
measure of the relative amount of the one or more FOR protein variants in the test sample. 

20 

49. A method of detecting variants of the FOR protein as in claim 48 wherein the 
quantitative measurement is of FOR HI and FORII and/or FORI to give a relative quantitative 
measurement of FOR in relative to FOR I or FOR II or both. 

25 50. A recombinant host cell having stably inserted therein a DNA of any one of claims 25 
to 30. 

51. A recombinant host cell as in claim 45 capable of expressing a protein according to any 
or of claims 31 to 41. 
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Figure 4 
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Figure 5 
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FIGURE 8 

FOR raRNA sequences 

A . FOR I mRNA sequence SEQ ID No 28 

CKnoccnrrcx^AOOCXXJAcnxsAcrrnxTC 
tccacagtiagccatgck:agcgcix>cgctacxxggc^^ 

CXXiCrGGGAGGAGAGAACCAOCAAGGACG<Xrrc^^ 

cxjaacatocaaaaactcgaaaaagaaaacgactcgcaccacat^^ 

TCAGAACCCACAAGTCTrriTlXrrTCAa^ 

actotcwatgataatccgaccaackxaac^ 
cttxagggoogggatttcaciggokaag^ 

CCAACTCTrTIXXXCnP(XATXXn<K>C^ 

GTCAtXKlATmAGAAGAATGGCATAAAGCCAAGGTACAAACM 

GTXX>^GCATrnX>CTGAAOCATTX:AAGGC^ 

TCKriCTACCCrCGAGICTCACCAAAGATG^ 

c 1 1 oiUiAGCTCcixxAGCATGrnrcrr^ 

TCGATTTACAGATATTAACGACTCCmXKjAAAACrc^ 

TCKXXX3ATGCTXX3CrrATAACAGGTXX:^ 

CCXXACX5CXXXXnT:ACGT0CAACCCAGIXK:AT^^ 

GGTCTACACACTXXnT7ITTACXnTCG(XAGGCXr^^ 

GGTXjXCrTCrcATTXjTCAGTCACTrTC 

B . FOn n mRNA sequence SEQ ID No 29 

TCGGGCCCXGACGCGCGCGCGTXTCGTnWAGC^^ 

GATACGGGGCCCAGGTXX:CrcCACAGTYAGCCATGGCAGCCCT 

AGTGAGGACGAGCnGCXnXXXXXXnXX}GAGGAGAG^ 

ACCXjAGGAGAAGAClX>CrrCKX3AACATC^^ 

OGATGGGAACAAGAAACTGATGAGAAOGGAOWVGTGTrrr^^ 

TGGACOCAAGACTGGCGTTTAOX7IXX3ATGATAATCC^^ 

CCA<XA(nXXXV\TGGAAATrCnXXAOGGCOCGCAT^^ 

AGGAATAGGGrrCXiAAAC(XXX:AAGTXnTIT^^ 

OCAAGGG<XAGTGAAGCAGTGrrCACXK:AT^ 

GACCTCGCTCTXXTCCGTAGCGTGCAGCAT^^ 

TGixnxxv^cGCAGCAAcrrmK:^ 

AATXrATCIXXjGGCACrrCTACCITGTCrAGC^^ 
TGIGGTCnXXlX:ViGAGTCXX:ATGGATITAC^^ 
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ctccaacaaaaaacgactattockxx;atgct^^ 

CAACATTCATCCKIACXriXXTlXKXrKrrAC^ 
CAOOGAOCTCKXIACCACCGTGTACrcnXXTC 
ACT<XnXXX:GCTGCATXXXXrrCACCACAAGCTCV^ 
AGACiGCTXJATCCAACAACCXKrnXKX:^ 

cx}CccrGrrcrrGTcnxxxxnx:ACGCAACT^ 

CAAC5AGTAAA<XjAAATAAGACCACrrCACAACAGAGTG^^ 

CIXXKKTTAAAGTATXilACTTTIXrr^^ 

AGTAAAAACCTCCrrcKniGTCTAGGTra:^ 

ATAGAATAGCCTXSAGGTOXCTXXTrCXXATCC^ 

TXTrCCTACTTAGGGAAGAAAAAGCAAGTCT^ 

CATCXTCAOlAAGACrGAGCCAGCTTACCAACrC^^ 

AGGATGACAGTCACACCCAGAOGGAGTAGAATAaxrAGAACrACCAGC^^ 

CTTTGCTAATCCTATXXIAAAAAATTXnT^ 

CGCAGGAAGGGAAGCGTATATACrrAAGAATACACAGGATATrn^^ 
TCCCrrrCTCTCTXIVVATCACACTCI^ 
CCTACXXnTAATAAAAGAACATGCTTGAATATC (polyA) 

C . FOR ni mRNA sequence SEQ ID No 30 

TGCCCCGA(XiCCCG<XGGTCrcGrri^ 

AGGGGGGCCACGTXX:CIXXACAGTYAGCCATGGCAGC^^ 

GAGGA(XAGCnXXnXXGGGCTGCK;AGGAGAGAACC>CCAAGGA 

GAGGAGAAGAClX>\GTGGGAACATCCAAAAACTXXiAAAAACAA^ 

TXXK^AACAAGAAACTCATGAGAACGGACAAarcrirrriU 

ACCCAAGAaX}GCGTrrACTGTGGATGATAATCCGACC^ 

OD^CIXXXIATGGAAATICTCCACGGC^^ 

AATAGGGTlXXAAACCGCX^AGTCrrnXX^^ 

ACGGCCACTCAAGCAGTGTX>»LCOCATmACAAGAATC^ 

AG AATAAAAATTTTXX:ACTAGCAAAAG AAGGAAAAAATAAAAGATCTTGAATAG^ (polyA) 
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D . FOR IV mRNA sequence SEQ ID No 31 

TCGGGCCXXX3A00CGCGCGGGICrc^^ 

CATAGOXXXKXIAGCnXXrCnXACACrrCAGCC^ 

ACnX}ACK3ACGACKnXXXnXXCK»CTX3GGAGGAGAG^ 

GGGGGCCCCAGrrOCGGCCGCGGACXXrACCTXXCACr 

A<XKXCACTCCAGCGCAOCGCGTnGCGGTCCAAA^ 

AGGCrnxXCACTACGGOCCGGCCCCCITGGTX^ 

GCGCCCTXnX^CrCTTCAGGATCCAGCACTGCGCGGCX^ 

ATAAAGATGTTTTAAAAAGCCC (polyA) 

X = T / C polymorphism 
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Figure 9 

FOR proteins sequences (unique C-termini underlined) 

A. FOR I mRNA open reading frame SEQ ID No 32 
MAAUiYACLDim)SEDEliTCWEERTIXIX}WVYYANHTBEKTQW 
FFVDHlNKRT^YLDPIUJ\FrVDD^«TKFTmQRY^^ 
HVII>CRNMARASEAVSRII£EWHKAXVETMTLD1AU^ 
GLETTFQVNHLGHFn-VQLLQDVlXn^SAPARVIVVSSESHRFrorN^ 
NILFS^Etl^RRLSPRGVTSNAVHPGNMMYS^^HRSWWVYTLU^^ARPF^ 

B. FOR II mRNA open reading frame SEQ ID No 33 
MAAUlYACU)im)SEDELPPG\VEERTTKIX;WVYYANHTEEKT^ 
FFVDHINKRTITU^PRLAFIVDDNPTKPTTRQRYTC 
HVUJVaysfMARASEAVSRILEEWHKAKVCT^mJDLALLRSVQHFAEAF^ 
CUETTFQVNHLX3HFYLVQU.QDVLCRSAPARV|WSSESHRRT)INDSLGK^ 

CNtLFSNELHRRiiiPRfiVTSNAVHPnNMMYSNIHRSWWVYTf .1 FH ARPFncSMOfyi A A JTVYPAA Vp FI FHI n 
GMYFNNCCRCMPSPEAOSEETARTLWALSERLIOERLGSQSG 

C. FOR III mRNA open reading frame SEQ ID No 34 
MAALRYAGLDOroSEDEliTCWEERTTKIXSWVYYANI^ 
PFVDHINKRTTYLDPRLAFrVDDNFrKPTTOQRYDCSTTAMEIUJ^ 
HVILACRNMAR 

ASEAVSRILEE WKTKYHPPPFKCRIKIFH 

D. FOR IV mRNA open reading frame SEQ ID No 35 
MAAU?YAGU)DTDSEDELPPGWEERTTKDGWVYYAK 
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FIGURE 10 



A. Comparison of FOR open reading Trames and location of WW and Oxido*Reductase domains 



WW domains 
/ // 



Oxido-Raductaee domains 




exon 



position in 36 58 77 

coding 

sequence 



alternative 
splice 



9 (10 untranslated) 
9A 



alternative 
splice 



B. FOR WW domains 

TO/? sequence (wwi) SEQ ID No 4 DELPPGWEERTTKDGWVYYANHTEEKTQWEHPKT 

FOR sequence (WW2) SEQ ID No 5 GDLPYG WEQETDENGQVFFVDHINKRTTYLDPRL 

WW domain consensus -LP-GWE—tttGt-YYh-HNTtTTtW-tPt- 



SUBSTITUTE SHEET (RULE 26) RO/AU 



wo 01/44466 



PCT/AUOO/01539 



14/75 

FIGURE 11 



DNA sequence of axons and flanking intron sequences 



FOR exon 1 position 22001 - 23000 AC009044 reverse-complement (SEQ ID No. 36) 

ccaggccctgcccctttgacgccggccgtcgcgatattgcggagactggatttcagcttcgtggtcggcg 
gagcggcccctggagggcgcagtgcgcaggcgtgagcggtcGGGCCCCGACGCGCGCGGGTCTCGTTTGG 
AGCGGGAGTGAGTTCCTGAGCGAGTGGACCCGGCAGCGGGCGAmflGGGGGCCAGGTGCCTCCACAGTCA 
GCCMSiGCAGCGCTGCGCTACGCGGGGCTGGACGACACGGACAGTGAGGACGAGCTGCCCCGGGCTGGGA 
GGAGAGAACCACC AAGGACGGCTGGGTTTACTACGCCAAfltaaggggg ccg c ag t ggggc c gcggacgc a 
cctgggaccctgcacagcccacggacgccacctgcgcggggaggacgcgcactccagcgcagcgcgtgcg 
gtgcaaagtgaaagtaactgttaaggagcttcagggaaaagggtccagggttcccagtaggggccggccc 
ccttggtgggcctcgggtccagcgggggtcacctggtggcttcccggcgcgccctctgctgttcaggatg 
cagcactgcgcggcgcggcgagggcaaagcggcctcatccccgccaaaaaataaagatgttttaaaaagc 
gcacatgctcagctccctcctgcaggctctgggttgcagggataggagttttgttgtgttttgttttgtt 
ttgtccagaccggtattgctcagtcatccaggctggagtgcaggggtgccatcatagcccactgtagcct 
ctacctactgggttcagacaatcctctcatctcagcctcttgagtggctgggactacaagcgtgcactac 
tatgcccgactaattttttaagtatttgtagagactagggtcgcaccatgttgcccaggctggtctcgaa 
ctactgggctcaagcaggccacctgcctcagcctccagaattgagattacaggcgtgagccactgcgcct 
agccaggagtattttttag 



FOR exon 1 A (3' end only - 5' limit of EST sequence shown) position 22001 - 23000 
AC009044 reverse - complement (SEQ ID No. 37) 

ccaggccctgcccctttgacgccggccgtcgcgatattgcggagactggatttcagcttcgtggtcggcg 

gagcggcccctggagggcgcagtgcgcagGCGTGAGCGGTCGGGCCCCGACGCGCGCGGGTCTCGTTTGG 

AGCGGGAGTGAGTTCCTGAGCGAGTGGACCCGGCAGCGGGCGAiAfiGGGGGCCAGGTGCCTCCACAGTCA 

GCCATGGCAGCGCTGCGCTACGCGGGGCTGGACGACACGGACAGTGAGGACGAGCTGCCTCCGGGCTGGG 

AGGAGAGAACCACCAAGGACGGCTGGGTTTACTACGCCAAGiaaGGGGGCCGCAGTGGGGCCGCGGACGC 

ACCTGGGACCCTGCACAGCCCACGGACGCCACCTGCGCGGGGAGGACGCGCACTCCAGCGCAGCGCGTGC 

GGTGCAAAGTGAAAGTAACTGTTAAGGAGCTTCAGGGAAAAGGGTCCAGGGTTCCCAGTAGGGGCCGGCC 

CCCTTGGTGGGCCTCGGGTCCAGCGGGGGTCACCTGGTGGCTTCCCGGCGCGCCCTCTGCTGTTC 

GCAGCACTGCGCGGCGCGGCGAGGGCAAAGCGGCCTCATCCCCGCCAAA AAftMJXA GATGTTTTAAAAAG 

CGCACatgctcagctccctcctgcaggctctgggttgcagggataggagttttgttgtgttttgttttgt 

tttgtccagaccggtattgctcagtcatccaggctggagtgcaggggtgccatcatagcccactgtagcc 

tctacctactgggttcagacaatcctctcatctcagcctcttgagtggctgggactacaagcgtgcacta 

ctatgcccgactaattttttaagtatttgtagagactagggtcgcaccatgttgcccaggctggtctcga 

actactgggctcaagcaggccacctgcctcagcctccagaattgagattacaggcgtgagccactgcgcc 

tagccaggagtattttttag 



FOR exon 2 position 30501 - 3 1 500 AC009044 reverse complement (SEQ ID No. 38) 

ctacaggcacgtgccactgtccccagcbaattttgtatttttggtagagacagggtttcaccatgttggc 
caggatggtctccatctcctgaccttgtgatctgctcccctagggctcccaaagtgctgggattacaggt 
gtgagccaccgctcccgacggcctctagatattttgagtgattcagcaaacctcctaaagttgaccccgt 
agctggggtcacagtcctctttctccttcttccccctacttccttcttatatctggctatctgggagaga 
aaaaatttaatacaattgattactttttagaagagttaatttttacttattactgtggattttttgtttt 
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ttaaCflflTCACACCGAGGAGAAGACTCAGTGGGAACATCCAAAAACTGGAAAAAGAAAACGAGTGGCAGG 
AGatttgtatgttgttgtctaaggatcttggatggaagcattaagtagatgaggaaatgtcactggcaga 
gaergtgacaggttattgtgtgtttagggagggctctctggtagagaaccagactcccactctgaggagct 
tatgggatttggcagaagaagatgatgagatcatagggtggagtgaggatgatttcttactggtcttaaa 
gtaccaaaatggccagatgtggtggctcatgcctgtaatcccagcacttcgggaggccgaggcggatgga 
tcacctgaggtcaggagtttgagactagcctggccaacatgttgaaagcgcatctcttctaaaaatacaa 
aaattagctgggcatggttgcatgcgcctgtaattccagctactttggaggctgaggcacgagaagtgct 
caaacccaggagctggaggttgcagtgagccaagatcacaccactgcgctccagcctcggtgacagagtg 
agatgctgtctgaaaaatagcgacaacaacaacaacaaaaaccagaaaataattgaggcctttgattcac 
agtttatgatgttttcctat 



FOR exon 3 position 32001 - 33000 AC009044 reverse complement (SEQ ID No. 
39) 

acaagaggcaaaaatgtggagcccagggtgggatcaggggccttctgcagctgtcgcagctggaacatgt 
gacgaaagccagttgatgtgacaactgctgggtgggagggacaggcttgggggcggggctgggagggctc 
cttcccttcctgacccagggatggtctttacttctccctggcacctgtagacctgtctttcttgtgtttc 
agATTTGCCATACGGATGGGAACAAGAAACTGATGAGAACGGACAAGTGTTTTTTGTTGAsttaagtgtct 
gcaaagaaaccactctcagctgttttgctttttaataggaatttttaattataaaagtaatacatagtaa 
ctgtagaaaaatacaaagtataacagtgtgtatctgtaatcttagcagaagtgactacttttaacatcgc 
tggaacttttaacatcgctagatttattcttctattttccccgcacacgcatccttaaaaaaaaaaagta 
gggaggccaggcatggtggctcatgcctgtaatcccagcactttgggaggtcgaagtgggtggatcctga 
ggtcaagagatcgagaccattctggccaacatggtgaaactccgtctctactaaaaatataaaaatgagc 
tgggtgtggtggcacgtgcctgtggtccctgctactcgggagggtaaggcgggagaatcacttgaaccca 
ggaggcggaagttgcagtgagctgagatcgcgccactgcactccagcctgggtgacagaatgagactcct 
tctcaaaaaaaaaaaaaaaaaaaaaagagggaatgcactgtgtggactgtttagtaacctgctttttcca 
tgtaatatattatgagcattttcttgtatccatatctatttttcaaaaagatgctatttagcagctctag 
agttattattttgtacagatatactataatttatcaaattgcctattggacattaatgctaaattcctgt 
tatagagaacctataattat 



FOR exon 4 position 37001 - 38000 AC009044 reverse complement (SEQ ID No. 40) 

cttctctacgtctttaggcccttgcacagggctaagagatttaattgaaatctactgaaaagagtggctg 
acagatcttatagctacatttacatgaattacataaaagcccaaaaccttctcaagaagccttttttgag 
atctaaggatacatggcaatagttattgtatgttacagcgtatgttataggcagttctgaaggaaaggat 
tcgatgactatctttttttggcacaaatgtgatccttctggaggccagaagatagattcagtgggcccca 
gttctttcaggtttaaggaataagcattttggtctatgaaaaatggggttttcctaaagtataagattgt 
cttatatttataaatgcctgtgttcattgctgtgggttcactgctttctcttttgggcagCCATATAAAT 
AAAAGAACCACCTACTTGGACCCAAGACTGGCGTTTACTGTGGATGATAATCCGACCAAGCCAACCACCC 
GGCAAAGATACGACGGCAGCACCACTGCCATCGAAATTCTCCAGGGCCGGGATTTCACTGGCAAAGTGGT 
TGTGGTCACTGGAGCTAATTCAGGAATAGgtaggctcttcacttagttatttatctttgggactgctata 
atgagatccacttagatctagctataatggaattttgtttagtggttctctgatttaaacatgactttta 
tccttttcagatatcgtttcattaacatcactacctctttttaaatcctaatgttgtcatggaagcctgt 
gtaggggctgaccttgaagtctctgaaagctgaacactcagcaaaagactgtggctattttggtattcag 
ggatgagagacaacaggctccgtctaagagtttttgacctggtctgcatggtgtatgggcatattccaat 
tacctgggttattcaaaccaaaattgattggtaatagaatacggggaacaacaaggtattatgtctttgg 
aacaaaggatactacaggtt 



FOR exon 5 position 86001 - 87000 AC009044 reverse complement (SEQ ID No. 41) 

gtttttgcagatcttggcccccaaaattcttagtcatagtgcccttctgatgccttcaaacagatatggg 
tgtgtatgcacgtgtgttattttgtccatcft.ttctaattgttctcagtaggaaatggggctaagaaacc 
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taggtaacactggtagaagcagaggtgctggcgacacttaccatagagtagagcataaagtccttatcgt 
gggcacaggatctggtcattattgatggttaaatggctggacatgccgtgaactcttctatgtcccagtc 
ttcatctttgaaatcactgcatccctaggacaagttttctcaccatcacagtgttgacagtccgggccag 
atggttctatgttgtcaggggctgccctgttcatggtaagatgttttgtaatgcctggtgtttacccact 
agatgccagtaggactctaccccacaactgtggccactgaaaatgtctccagacatttgcttctgtcccc 
tggggatcaaaatcacccctggttgagaacttggggtaatttaagtggtgctccggtaaaggccattcaa 
catgactcactgtgttgatgttatgttttctaacattgactttcctttaaaccataaQGTTCGAAACCGC 

caagtcttttgccctccatggtgcacatgtgatcttggcctcx:aggaacatggcaagggcgagtgaa^^ 

GTGTCACGCATTTTAGAAGAATGGataagtgcttgactgttgttgttttttttaattgtcaaatacacat 
gccgggctaaccatatggagatttcagtggagtgtgtaaagtttattgctcttggaatcatgtctttatt 
tttaaagtcatcttcactttgtttgtcttatgaatgaaagcatgagcaagtccattattattcacgcatt 
tgtttgtagttcattgtcagtgtcatctgctttattagtcaggggtttgagaatgtttcttcctttaact 
catttattaagaaaaataaa 



FOR exon 6 (sequence #1 - 100 of BACl-contig #778 AF21749i) (SEQ ID No. 42) 

gcttttgaaattaagatattgtccattgcatctttccatgatcgaggcgttgtcgtgcattcccctcctt 
acagcgttttaggatgcagattgagaggagataacgttagcccgaagcgcctttgcagcctcttttgcgc 
ggctgaaacacagcacaacccacccccccgccccaccccccagcctgtaggtttagcagaatcccagcct 
cacatcctccccgaacttggcagtaaaagccctgttcttccattcattccgatgatttatattctctctg 
ggcgtcttatattaaacaggggaattccgacatgttccataacacatttactgtaacttgataccatgaa 
ctacacttgctgttatttatcatttctttttattttctctcattgcagCATAAAGCCAAGGTAGAAACAA 
TGACCCTGGACCTCGCTCTGCTCCGTAGCGTGCAGCATTTTGCTGAAGCATTCAAGGCCAAGAATGTatg 
agtgttccagtggagggttatagatcataatttcttgctattgtaatatctttatcagatgaacacaatt 
gggagciatgcaaggctgttgtgttgtcttggcgtccaaacaggaggctcatttatattggccctgttaag 
gtgaaccgtattttcttgactcacagtcaccttcattatgagatgtgtcatcaatctaataacagcttcc 
cacataccaaaagagaagacactattaaagcactagtaaaagtggctaataaaagcttggcaatagtaag 
atgcatcctgattataagattttttgtagtgcatttcagaatggagtaagagtatatttaaattgcattc 
aggaacaagtaaactcaattatccaatatggcagggaggttgacaatccaagcacccaaaaaacctctag 
tttctaaagccttcgatgatttgatgtggtacatggatgtggttccaaaaaacatggactcacattcctt 
ttatttattttttttcatcc 



FOR exon 6A position 91501 - 92500 from AC009280 (reverse compliment) (SEQ ID 
No. 43) 

acatttgtttctctccaggatctcattggcggtttgctggtattatattgccatttatattggaaaggca 

ggcagtgcgaaaactgtagtctattaagagtttgtgagtgtgtgtatgtgtatacagtgaaataatatct 

agtgtaatgtgaagtgacagtcaaaattacagcttttctcttgccgacaagggtacttctctgccaagta 

tagtaatattttggtttttttgttcaaggttgccagcaattctggcttactctcttgagagccatattag 

tgtgtggaaggaaggagcttcctgtcaggggaaatactattattttgatatatatttgttggtgccaaac 

tgcaattaaataggaggtgttggaagaaggttttatttaanccagcttgaatttgttaattaattagcta 

aaacttatttttggtttgcttcaagaaacagttctatgttgttttcataatttaattttacacaactgtc 

ttttgtttgtatctt aCflflAAAACAAAATACCACCCTCCGCC AGAAAAGTGCAGAATAAAAATTTTCCAC 

TAGCAAAAGAAGGAAAA^AMaAAGATCTTGAATAGTCTCATCAAttacatcttcttttgtgggtatt^ 

ctggtcttttcatgttttgacttctatctcggtgatctgacagacatgtacgatttgcaacaacatctat 

tatatgatttaatttatacctttatcagtttgcagttatgctttacgactcttcaggtgactaaaganaa 

aagagggtgttttaaagtgtgtgtgtttgtgtgtgcgtgtacgtgtatacatgtgtttttctgtgtacat 

aagtgtgtgtgttttaaattgtttgaaaaacactagntccatctctattgtattatctgagggatgctag 

ctcgttacttgggatagtaacaagtttcttggtgacaactcctncctcccattgtcaaggaagactgagg 

atgtcaccagagttcgctgt 
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FOR exon 7 (sequence #3501 -4500 of BACl-contig #208 - AF217492) (SEQ ID No. 
44) 

gtggttgaacctgagttccaactagggtgacaactcttcgcagattgcctgccgctgtcctggttttcac 

actgaaagttccgtatcctaataagcctctcatcctcagcaacggaggacagttgcccactcaaagcctt* 

gtgacattctagggtatcctatttctacatggtgggaatctagaagacggagaaagaatttctcattccc 

gaaggagcatggattatccttggttgtagtgtttatgttccacatcacgtggattcccgaaggagcatgg 

attatccttggttgtagtggttatgtccacatcacatgggatattttatttttcaaGCCTCTTCATGTGC 

OTCTGTGCAACGCAGCAACTTTTGCTCTACCCTGGAGTCTCACCAAAGATGGCCTGGAGACCACCTTTC 

AGTGAATCATCTGGGGCACTTCTACCTTGTCCAGCTCCTCCAGGATGTTTTGTGCCGCTCAGCTCCTGCC 

CGTGTCATTGTGGTCTCerCAGAGTCCCATCGafegggtttgaattgcatatttgttcacttatccccttt 

ctcataccagctaatattcccccaaggctc teat tctgaaaataattt tea ttagtcctgcttgagacat 

gtgggtggactcagcttggctcacttaatttttccaggtcttttttgttcgcctgcgattgtgggggact 

gtttagaaggactttctagagcaaggaagattgcctttacgactatacttcaagctcctcattgattttc 

gcttacagatggaataataacttcatgaaaaactcaatggcatgaacctattattggatttgtaattcaa 

caac t tcaacatc t taccaagaagaatgtgcagt ta ttc tagcaggagaaacaa tgcaa t tagagcc tgc 

gagatgaaatcaaattgttttataatgagaaattagggaattcgaggcag 

acattagctgtgtaattgtggaaagggaagaactgtagtt 



FOR exon 8 position 4001 - 5000 AF217490 (SEQ ID No. 45) 

tttttgtatttttagtagagatggcattttgccatgttggccaggctggtcttgaactcccgacctcagg 
tgatccactcgtctaagactcccaaagtgctcggattacagatgtgagccactgcacccagcattcctta 
gatttccaataaaaataaaaagctgtgtgggaagtcagaacttggttgcttcatgtcatatttcctattt 
ttaMATTTACAGATATTAACGACTCCTTGGGAAAACTGGACTTCAGTCGCCTCTCTCCAACAAAAAACG 
ACTATTGGGCGATGCTGGCTTATAACAGGTCCAAGCTCTGCAACATCCTCTTCTCCAACGAGCTGCACCG 
TCGCCTCTCCCCACGCGGGGTCACGTCGAACGCAGTGCATCCTGGAAATATGATGTACTCCAACATTCAT 
CGCAGCTGGTGGGTGTACACACTGCTGTTTACCTTGGCGAGGCCTTTCACCAAGTCCATGataagagaac 
agcttctggcgccgcaaacaccttgggtcctagagaaacctgcacacttgtgtctccacctttttacctc 
ttgcgggcatgagtctggtctcagtaataacattgtccagcccatcataaagggctcttgaacacatttt 
catcaactttaggttaagtctgtttgggtaaatgcgtcttggagggctgggtagaagatgtgggtttcag 
tatcatgttaagtatggctgaaagtccttatggaaatggtgatttttttgtttgtttggttttgtttttt 
ttggggttttttattcagaaactttgaaaatctattttgttgaatggagcacttgaaaactgctgttttg 
tgtcagtaggtaaacaacaaacattggtgactactgaattttcagcagatgtgattcctttgtttcacag 
aaaaactggatcttttgttctaaatttttttcttctaatgggtataatcctctgttggagagtcctttga 
tagctaggagtgtgttttct 



FOR exon 9 position 264001 - 265000 of AF217490 (SEQ ID No. 46) 

tagaccccctttgataatctccactaagcagacatactcgatacatcttcactaatgagttctgacttca 
taaaaagtattaatgacttctttttgaaagtaagagtgctttgaataccagtcgttattgctttagaagt 
tcataaaagcaaaagcacagtatttcccccagtgtttgtgcgataagagaatagaatgtaggtcccagcg 
ccttagaattttaagctatgccttctcttggtttgtgaatttccajB^^TTTCAGACTGCCTGGTAGAAGGA 
GGTCACTTCTGATTGTCAGTGACTTTGsctgagttcttaccttgtaaaagatttacaattatttcattttc 
aacatagctttatcttatgacaaaggtgacagaaaggaaatctcctaagttggcctacagggtgctttag 
aaaacatctggctgggcatggtggttcacacctgtaatctccacactttgggaggctgaagtaggctgaa 
gtgggaggatggttagagcctaggagttcgagaccagtctgggcaacaacgtgagatcctgtctctacaa 
aaaataaaaaaaattatctgggtatagtggtgtgcacctgaagtcccagctaactgggagtctgaggcaa 
ggaaattgtttgagcctaggaggttgagagtgcagtgagccgtgttgctgccactgtactccagcctggg 
caacaggacaagaccgtgtctccaaaaggaaaaaaataataaagcactgtctctctctacccttgcagta 
tccctgtaggagagagttactattagctcccagtttataggtgagtgatatggtttggatgtgtccccac 
ccaaatctcaacttgaattgtatctgccagaattcccacatgttgtgggagggacccagggggaggtaat 
tgaatcatggggcccagcctttcccatgctattctcataatagtgaataagtctcatgagatctgatggg 
tgtatcaggagtttccgctt 
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FOR exon 9A position 91501 - 93000 from AC009129 reverse - complement (SEQ ID 
No. 47) 

aagtcgaggggttggatgaatttgttcttgcacgttcagaaggataccatctttttctctgtgtggaaga 

ggcgcctgccacgacgtattgatatgtgtatttattttccaagcctgtccctgtatgaggtgtcaaaagt 

tacaccagctttacaagcggagtttatgaactcgtttttccaggatagtcacattatactttttacagtc 

atgtgctttcagcccagtaccctttgctatgccaagatccagctgaaactgaaccaggtgggggaggcct 

gctaatgcccaggcagtcgaaatgacgccatctcatcactccttctcttaaaatttttttttgtctttct 

tcttggatttc caaCAACAGGGAGCTCCCACCACCGTCTACTGTGCTGCTGTCCCAGAACTGGAGGG'TCT 

GGGAGGGATGTACTTCAACAACTGCTGCCGCTGCATGCCCTCACCAGAAGCTCAGAGCGAAGAGACGGCC 

CGGACCCTGTGGGCGCTCAGCGAGAGGCTGATCCAAGAACGGCTTGGCAGCCAGTCCGGCTAAGTGGAGC 

TCAGAGCGGATGGGCACACACACCCGCCCTGTGTGTGTCCCCTCACGCAAGTGCCAGGGCTGGGCCCCTT 

CCAAATGTCCCTCCAACACAOATCCGCAAGAGTAAAGGAAATAAGAGCATTCACAACAGAGTGAAAAATC 

TTAAGTACCAATGGGAAGCAGGGAATTCCTGGGGTAAAGTATCACTTTTCTGGGGCTGGGCTAGGCATAG 

GTCTCTTTGCTTTCTGGTGGTGGCCTGTTTGAAAGTAAAAACCTGGTTGGCGTGTAGGTTC 

CTGGAGAAGCACCAGCAATTCrTCTTTCTTTTACTGTTATAGAATAGCCTGAGGTCCC^ 

GCTACCACCACGGCCACCACTGCAGCCAGGGGCTGGCCTTCTCCTACTTAGGGAAGAAAAAGCAAGTGTT 

CACTGCTCCTTGCTGCATTGATCCAGGAGATAATTGTTTCATTCATCCTGACCAAGACTGAGCCAGCTTA 

GCAACTGCTGGGGAGACAAATCTCAGAACCTTGTCCCAGCCAGTGAGGATGACAGTGACACCCAGAGGGA 

GTAGAATACGCAGAACTACCAGGTGGCAAAGTACTTGTCATAGACTCCTTTGCTAATGCTATACAAAAAA 

TTCTTTAGAGATTATAACAAATTTTTCAAATCATTCCTTAGATACCTTGAAAGGCAGGAAGGGAAGC 

TATACTTAAGAATACACAGGATATTTTGGGGGGCAGAGAATAAAACGTTAGTTAATCCCTTTGTCTGTCA 

ATCACAGTCTCAGTTCTCTTGCTTTCACATTGTACTTAAACCTCCTGCTGTGCCTCGCATCCTACGCTTA 

AX&AAAGAACATGCTTGAATATCAtcacctgaagtttgtattgtttctttaaatgtttgtttcagtttgt 

ttttgtttttcattttttagaaaagaaatc 



FOR Exon 10 (from GenBank AC009I41) (SEQ ID No 48) 

ctttltcicaagtgttgaaaiaaattaatggtgtggccatggttctcgtacttcagglccctgtgagcccclggggtcctacacaacttggggtaatgciatggt 

cacctgcatcccctcctciccagccctcacctggttttctcttctccttccaagaaaaaacacccataccattcttaacatctctggaagcigggggtcagga 

gagggagaiccataaagttcttgtcttccccgactcaagaagcnaagggtacaitcactcacicatgaaitgactgglttaucaticactggttgatiaaiic 

attcattcactggttaattaattaattcatgtcatcctttggatgtcttgcagagCTGAGTGCTGA A/trAAA ATGATAAACAAGTC 

aaaaacaaaaaggcctctgac 

ttaacaggacttccgtgatgcgggggaagaagacaataggcaagtaagcaagtaaatatacaagataaatagaaactgtgatgagggagttaaagaaat 
taaactggtccttglggaagcaacttcgggggaggaggaaagacagggaagaccatttiaaggaggtgatgttiga 



FOR Exon lOA (from GenBank AC009141) (SEQ ID No 49) 

aacagaaaaacatgccatcatctttaatttcctgcatcagctaagcatttantcatgtagtcctcacactagattgagtggttccatattaatatcctacgtcaa 
agatgagtaaattgagatgcgtaggttctaatttgcccagtgtcgtggtgtgtgatcttgaccactagcctaaactgctattttatgtccacccatcaacctcta 
cggctgcccctcatttgaacacacaaaatatagttgtgcggcttggtggagccatctaaattccgttaggccatttgccaatgctgctattaggggcgaaat 
gtcQtgcgcttgatctaaatgtacttaggagaaUctcaggacctgatgaattattattcggaattttatggcctcacsfOTTGCAGGCirCATA 
CCAACTGCAGCT 

AATGACCTATGGGCCCCGAGAAACACTGAGGACACACGGCGTrCTGCACACAGAGTGGGCT 

GTTTCTGTCrGTTCTCCCCCTGCACCCTTCTCAGATGCAATCTCAAGTCATAGGAGAACT^ 

GCAAATGTTTCTCCTGGATGGTTTCCTTTAGAGCATGTCTCCTATAACTTGAAATGGTTGT^ 

GACCAGAATGTTTTTAGAAGTTAGATTTTrTTAGGGGGGAAACAGGAACCAAAGCAAAGCC 

A47>IGy4 GATCTTGaaaaaaaaaaaaaaaagaaaaaaccaccgtggtattctaggaagaaaaaagcatmtcgaatgaaaactttttattatatt 

tgatatattctgcttcccuccctagtatgtanaatgagatgaaatcacttcttaattttcaggttaatattaaag 

ttgaagcccatccctctaccctgaggactctgccagcctctggcagtattcctttccaacttccacttgccccaaataggtagaagttagcctttatttttggtg 
tcatgtcttctttc 



FOR Exon lOB (from GenBank AC009141) (SEQ ID No 50) 

tttctccccgacatgcccctgccaacctccccccatcattccagctgggagaggccatcaaatgtagcttggagagctatagaaaccttgagctctgagc 

ccaaggagcaagttggagtcctgcctccagctgtgtagctcagataacagctggagcagagaacatgctgttctctctgctagaatatctgacccaattct 

gactagtaaagagagttaatagtttgtacctcaagtcattcttgtccctgtgtgaaggaagagaagcagtcatttccctccacctccuacacacactcc 

ccactctgtctctcttggctgttlttctcttcattgttgtlaaactcacgttcatltctcttgtafiTCAGGGAAATGAAGAGCCACTTTCA 

ACAATTCTGAAGAAGAAATATGCGA 
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TGTTTTGCTTCGGGATGGACGCTGGACCACGAGTTTCTCGAGTCTCCCAGTACCCAGTTTGA 

ATCCCAGCTCTGCCrmATCAGCTGGGTGTTGGGCAAGTTAGCTGACTTTTGTGAGT^ 

CATCdZIilMATGAGAACACTGTTaitggtctttcgggattgttttgagaaatgagatatcgagacatgcctggcacaaggccttaat 

tcttcltcatggtcaagaaatggcagattttcccccttccattcccaccctlgcacatagtaggttctcagcaagtatttgtagatgtaatcgaccagcagaga 

tcatttgtacccttaacacccacagagagtcacagatgctttcactgaaggagggtgtcccaagactcaatggcagggaataaaaatgccaagtcatgta 

agl 

attccacaaagttagaggggaggagtaagtatctcttattcgtgcatctttatggtatgaccaagggctcatgatttgtaa 
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Figure 12 

Finished sequence (270kb) GenBank accession number AF2 17490 
SEQ ID No 53 



GATGGGaSTrrArrATCAGATACACGAAGACAGAGACCAGAGGT^^ 

ACCTGGTAGCTTGTTACACATAAAAATTCTnKKXriTr^^ 

CAGTCTGAGGTTTArrTTTTCGAAAACIOTAAGTTC^ 

ATCTCCCACGGTGGTCATCXnXXAAAGATCAAC^ 

TQCTCTACCCACCACATGCAAGCCCCAGTGTGTGTTATraX^ 

CATITATAAACGAGAGCATACAGTGTTTGGTTTTCAGTTT^^ 

CATGTCCCTQCAGAGGACATGATCT CTXU C CTi ' in ' i ATG^ 

CATCCTATCATTTATGACCATTTCGAGTGATrcC^^ 

CTCXXiTATGTAACCAAAGGAATAGAAATGATrcTATTATAAAGGTACATG^ 

AGTAGAAAAGACATGGAATTCACCCAATCCGAATTITAACAAGTCl^^ 

GGTTCTGGCOXnXKIAGAACTGAGGTGGGCrCCAAAACTT^ 

TGTAATCCCACCACTTTGTGAGGCCGAGGCGGGTCGATCGCTTGA^ 

GAAACTCCATCTCTACTAAAAATATAAAAATTTGCCGGGTGTAGTGGT^^ 

GGGGCAGGAGAATCXXriTCAACCTGGGAGGCAGATGTT^ 

CAGAGCCAGAGTCTGTCTCAAAAAACGAAAAACAAAAAAAACCAAAAGACT 

GATGCTACTTGTCTTGGCCAATGAAATGAAAAGCCTACAGGCCAQGC^ 

GAGGCTGAGGCAGGCAGATCACCTGAAGriCAAGAAriCAAAACCAG^ 

AAATACAAAAATAACACCAAAAAAAAAAAAAAACCACAACAAATTAGCXyW3GTG^^ 

ATrrGGGAGGCTGAGGCAGGAGAATCTCITCAATCCAGGAGO 

TAQCCTGAACTTCAGAGAGAGACTATCTCTCAAAAAAAAAAAAAAAAAA^^ 

GAGCAAAGGAGTCrrcTGGAAGACTATAOQGAGTGTTGAGGACCTGTC^ 

GGAAGGACTTCTTAATTCCAGCTCCCTGTTGCCAT^ 

AAATTAAAAGCX^TAAAAGTriTCGCTrATAAAAATGGTATTTCCT^ 

GTTITTGTTTGTITTCGAGACGAAGTTTCGC^^ 

GTTTAAGCCATICTCCTGCCTCAGCCTCXrrcAGTAGCTG^ 

AirXTTTAGTACAGACGGGGTTTCAGCATCTTGGCC^ 

GCXriCXirrAAAGGGTTGGGATTACAGGTGlCAGCTAGT^^ 

TCATAAATmT-AAACACTCTGCAGTTCAAAGAAAACATC?^^ 

GTTATTAGCAAATCCCTCAATGTGTTGACCAAGAGTGGGTAGGAATC^^ 

TTTTATICCTTTATAAGTACAGTTGTAGATTTACAAAGTAATr^ 

CGCAATTGAGCATATAGTACATQCTGTCCTGriGTACCCACTGA 

TTGCATTAGTGTGGCACATTTGCTACAATTAATAAACCAATAT^ 

GCTGCTQCTTGTGTTGTACATTCTATAGGTTTCTTCAAT^ 

CATTCCAAlTICATACAGCGTAGTITCAaSCCTCT;^^ 

CTCATTTCCCCAGCAAAGTCCTTCTGAGAATTrAC^^ 

ATT&TCATTITGATTTGGAlTTICTTGAaGGC^ 

Tra\GCGGTTATCAj(X;TCACACCTCTAAaC^ 

AAGAGCTTACAGAGATCCTTCTGGTTCAGAGGAAAGATTC^ 

AGGCTAAATTATAGTCCATATGACTTTrCCirrAAATCCCT^ 

TGGAGAAAAAAGTACTGATroTCCAGCAGTTXnxn^ 

TGTGGCX3VTG?VAACCTTGCCTGGCCCCCAGGGATGGQGTGAGGG(^^ 

GGTCTGCCCCTGTGCAGGCAGGAATCnTCAGGAAGCAGGCTGCCT^^ 

GCATGGATCCTGACTAAGCGGAGCAGATGCTGCXTCTGTATT^^ 

ATGTGIUTITIO'ITAGTITTTGAGACAGAGTCTUSGTCTCT^ 

GCAAACCCTGCCIXXCAGGTICAAGCGATrCTCX^^ 

GCCCAGCTAATirrTATATITITACTCGAGACGTAGTTTCGC^ 

TGATATGCGCXXXrrcAGCrrcCTAAAGTXXrrtXSGATT^^ 

ATCACCATTGTICAATCTTGAAAGGAGACTTO?^^ 

TCAGATItXTTITrTTTTTTTITGA^ 

CTGCAACCTCTOX?rcCCATGTTTCAGTCA'IT^^ 

rmTGTATITITAGTAGAGATGGCATTTT^ 

gtctaagactoocaaagtgctcggattacagatgtcagccac^^ 

agctgtcixx3gaagtx:agaacttggttgcttcatg^ 

ggaaaactggacttcagtxxcctctctccaacaaaaaacgact 

caacatcctctictccaacgagctgcaccgtcgcctcr^^ 

tgatgtactccaacattcatcgcagctggtgggtgtac^ 

TaXXXSGGCATGAGTCTGGTCTCAGTAATAACATTGT^^ 
AGGTTAAGTCTGTITCGGTAAATGCGTCTIXXSAaS^^ 
AAAGTCCTTATOGAAATGGTGATTTTTTTGTTTG^ 
TCTATITKnTGAATGGAGCACTK5AAAACTGCT^^ 
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TICAGCAGATGTGArrCCTTTC 
TCTGTTGGAGAGTCCTTTGATJ 




XnrCTAATGGGTATAATCC 
LTTITGGATGAATCATGGTA 



CrGTGGTTACATTTGGAAGTGTTTACAAAGCTrcATAA^ 

ACCTTCOTCTATGAAATGGTTTTAGIAGGAAGTATTITCA^ 

TTTITCAAAACCAAAGGACTTCAAAAATGriC^^ 

TreACAGAAATCACCCTTTTCXXAAACACAT^^ 

ATAAAACAAAGCAAAACAACAACAAAAGftAGAACTATTGCAG^ 

TCnUIXXn<XXTrCTCTTACACTCAITOO^ 

TTTTTITrTTITriTGAGACXSGACnC^^ 

TcmxrrcccxxsGTTCToxxAaTCitx^^ 

CTAATTTITrcTATmTAGTAGAGACGQGGTTTC^^ 

CCCGCCICGGCCTCCCyU^AtH^ 

CAAAAGTCTITTGAAGTGCTTAGGAGG^^ 

TAAACAGACATATTCX:AAGAGTCGAATACTTAGGATC7W7^ 

GCCGCAGCSGTCGGATTCATCTCTAGGAAGTAAAT^ 

CATCACTTTGCTCXXXSATCACTGCATAAGC^^ 

TTGIGTOCTTCAGTTTACATGAAAAGAAQGAATATGGATC^^ 

ATGQGGAGAGGAGAGAGGTGGATICITCCITATTIT^^ 

CATTTCn-AATCTirGGGTCTTGAGTCATCT^ 

CAGAGAGTTCGTTTICTCCTrcTrcCATAAGGr^ 

AATAGiaSAAACTGTTATIXrTGTGGCXrTCATCCCA 

CAAGTTCTGCTAAAATATAAAGTCAGGGTAAACATTTTr^ 

AATATGCTTAATAAAGTCAGACTTGAAAGTTGGCTGGiU\3GACOT 

GTTGGATCGCnGACTCTATCATGATATCGGAGTGTTAGC^^ 

AAACCAGAGTATGGGGTTAGAAATX^GGCAGAGGGAC^^ 

CAACTCAGTCAGCACCGTACCTGTAGTGTGTC^ 

CCACCAGTIXnGCAGCAGCCTCTGCCGCACCCT^ 

TGAACAGQCXTTCAATCXrCrATAGCAAGGCAA^ 

AACTCXX;CACTGAGCriTOAGATCCClKy«?^^ 

AAGTGGTCACATCGCAAAAATCCXyonTrTCnTGAAGTTC 

GCATTKXCTTGTAGTTGAGATGGGATTTGaCATC^ 

GGACCTCACAGCTCTCCTGCTAATGGTGCTGATCXCT 

TCACCAGCTTICTGAGCCTGTCTCATTACGATGAAAATT^^ 

CATCAATAATGATGGATATAGAATTTCTXXAGTTArrATAATC^^ 

TATTGAGTTCTAAAAATGTGGCTAATAGAACTGAGGAAGTCAAT^^ 

AAAACACCACGTGTGCa^GTGGCTGCCATACTQGAGA(XGTGGT^^ 

GAGGTATGCAGTACTTCCTGCCACGATGGCTGTTACTCT^ 

AAAATGTAATCTCTGCTGACAGAACAAGAGTCrGO^ 

CXrAAAGACTCAGGGAGAGGTTTACTGGCCAGGTAGAAAGGGCGTC^^ 

TGCACGTCrrcATTTGTGAGGAAGCXrGTAGGGTGTC^ 

TTATATGTCACAAATGGACirrAGATAAGCATGAAGATGACC^^ 

CCTCTCTAGTCyrATrTATATATTTAAGCGTTTGGTAGTAC^^ 

AAGTTATTCAGCnTTITGCCATTGAAAGTAATACGTCACT^^ 

AATTCACAGGCATTCACTTTTCOTTCATAAACATTO 

GGGTTGGTCATTITCCTAGTAAGATAGACCAGACTCyvCri^^ 

GCATTGCTTCCGTCAGGATAATGCATCAGTTTCT^^ 

GATATAAGCCACGCAGTTGCATCXrCATTTCTTCC^ 

GATGCAATGltXXn'ATCACCAGATGGCTAGTGTQCrr^^ 

CACGGGTATTOrrGGAGTACACATTCTGTIXXATC^^ 

CCTGCCTTCATTTCTTCTCriGTAGCTGGCT^^ 

TGCTGTGACTTXXTOTGACrrACGGTGATTTTA^ 

CrcTTTAAGGAATATTGTTGCAGATCAGTAATTITTVCM 

ITAAATGTTGCCCTITrACAGTTACACATTTC^^ 

CATTITGGCTCXXrrGCAATATAATAATCACTAAT^^ 

AAAGCCATGTCTCTGCCTOXTCnTCTlXn^ 

GITOnrrCITCCTCCCTTCAATT^ 

TTCTAAAAAAGATGCAATATACTTCAAATAGCrroC^^ 

AATAATGAAGTCACATCATACGTGCATTACATGTIT^ 

TAAAATGTTTTCX:AOCnxy^CGCATTAAT^^ 

tagtatcratatttatatatagatattagtctatgggltitaattc^^ 

gattttcaaggcraatcttcggtttttgcagt^^ 

gatgcagattgaccattticcx:agatacx:acgatgctitat^ 

TTAGAArrAATAATICCAAGTrcTACAATCTrAGAATrATTA'^^ 

agcgtcttcaagaaaatgatataccaggcttctaaacttic^ 
ggtaatgcccctkxialccatatctagttatgcagtmt^^ 

AATGCTGCCACOiCOQCCAACCXXy^CX^ 

TCCCAGGTCCAAGCCGGTCCTTTGTGGACCAGT^ 

TCTGCAACCACATATITOIXTrTOGCAGATAACAGTA^^ 
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ITAGCKWVCTGCATTTTCAAATCATITrACAT^ 

TICCTGTGC(XGTGTAAAGCX:AGTCA1CGT^^ 

TGCTCGTOrrGriXXTVAAGTTATAAATTATAACrAG^ 

CXXXXSTAATTCTGCCCTTGAATAACCCGCTGCAGGCn^ 

GATGTTGATTrcriTanX5AAATTAGTAT^ 

CACTATAGATACTATCAAGGGCAAAGCCXXXnXSTrrcAATCTC^ 

TGTATAAAACAGGCOTTCTGTTGCGAGGGGGAAATATGGTTTTC^^ 

ACATTTTGAGACTCCTGAGTGAGGTTTGAATTm^ 

ltnx:AGGACCXXnCTOrcGTTCGACCACACT^^ 

GTCXriClTACACXZACTGAGGCXZACCI^^ 

TTAGTAGTCTTACGTCATTATGTCCAAAGCTTTTAAGrmAC^^ 

OrTGGAAAAGGGCAAAGGATTTAGAATCATGAGGGTTGTArrCW^ 

ItW^TOVTTTGACATTTCTGAGGTTAATTTO 

AAQGTGGCTGAAATAATAGAGGTTTIXXriTCCCATGTATCC^^ 

ATGAGTAGCTCCTTTGTCATAATClTIftCTriTXXC^^ 

ATTIGATAAGGTCTGAAGGCACAGATCGGAGGGATGAGGC^^ 

ATTCrianTCAGAACTGTCCTCCTGAGT^ 

AGTTATAAAGGAGACTAGAAAAGTGCATAAOTGGCTTrTTOAGCC^ 

AAGGATAGGGTAATGGTOITGAa^CAGaCACTGTTGAAT^^ 

GAAATGOTATCGATTTTATAGGATOmCTSAGAAGAACST^^ 

ATCCACTIGACCCACCATACCTCAAGCC^^ 

GCCTAGAATACTCCTCCIXXACTCTTAATTTCTACCXZATC 

TITTQGTCrCTCMATrATATGAAGCTCTCCAA 

GTAGCACTCCTTATGTGTTCAATCAGTAAATGAAl^ 

TGAAAACATAATITCTGTA Gn ' iViU ' iU ' l ' nTlC ' ri ' I ' i ' inTl^l^ 

GTGGCACAATCCCGGCTCACTGCAACXnXXXXX^^ 

GGATTACAAQCACIXXXrCACCTVAACCCAGCTAA 

GGTCTCAAACTCXrTGACCTCAGGTGATCX^^^ 

GCX:TGGCCTGTAG ri ' lX. " i ' inn ' i C iTiU ' lUCriU ' lCTA ' iUCl ' i ' l ' inT 

AATGCAGTQGTGCAATCTIX3GCTACTGCAGCCTCCa^^ 

TCAAATACAGGTGTGCACCACCACGCACGQCTGATG^^ 

TUSTCTCAAACCTXKXrrcAGCCrCCCAAA^^ 

TAATGICIT?rCCCCCX:TTCTCTT^^ 

TGTAAATGlTKTrAAATGAATACATGAAAGATGACCAGGTGCaX^ 

ATCCCTCntSAGGTAGGCAAGGGATTATAGTTTAGAATGTGTAAATACCT 

CACACAQCAACXX:CCCAAGGTCCCCCAGGCTCAT<^^ 

GGAGGGAATGAATGTAAGGAGAGAGAGGTGAGQCACAGGAGAGAC^^ 

AGCXXnT^GCTCICICCCATTCTAACTCriTC^^ 

AACTACCTCATGGGCTTGTCnTGAGJ^TTAAAGACGTrATGAAC^ 

GTTAACTACTA'PrcTTATKXXriTCXMTAGAAATT^ 

TAACCAlTATTXXTGTCy^TTATTAACATmTTTT^ 

ATGAGTGTGrnn<n<?rGTGTGTCnxnX3TGIt^^ 

CCAAGCTGGGAGGATGACTTGAGCCTCGGAGTTGGA 

GACATAGTGAGftCXXTATCTCTAAAGGAAAAAAAAAATTATATGTATTCOT<^^ 

ATXKIATTCnGAAATGACTACCACAATCAAGCTAATTAAC^ 

AAGAACATTGAAAGTCTAGTCTCAGCAATITICAAGTATAT^ 

ATGCTGTACAATAGATCTCCAGAATGTATTCATTCr^^ 

ATCACCCCCTCaZACTTCCCCACTACCAGCCCCTC^^ 

TAAATTCCACACAAAAAGTGAGATCATAGAfyrATITCrCTT^^ 

CTTCOCATTC?ITIATAGATAC(XATO 

rrcAACATAAAAACCTCATATCSGGCCAGGCACAGTCS^^ 

AATCACCTGAGGTIGGGAGTrCGAGACCAGCCTGGTC^^ 

TGGCTGGCTQGTCATGGTGGCTCATGarrcaV^T^^ 

ATTGAGACCATCCTAGCTAACATGGTGAAACX:CTGTCT^ 

CGCGGTCX5TC3GGTGCXTOrAGTCa^^^ 

AGTGAGCCAAGATCGCACCACTGCACTCXAGCC^^ 

AGGCTGTGTGTGCnGACTCGTGTCn'ATTCmXXyVOT 

GGAGGTTGOUSTGAGCTGACATCGCATCACTGCAI^^ 

AAAAACCCAAAACTATATGTGCAAGCCTCAGGAACCACTA 

CACAGTGCAGAirrTAAGCCCTCCAGTTO^^ 

ACTrrcTTGTTAAAAACAAAGAGAGGGAAACTAGATC^^ 

COTCCTTTATAGl^nriTAAATCCGCC^^ 

GCCTTAGTATCrrGTAACAAATrATTTAATCAACyVT^ 

TITICITCTGTGTGCCCOTTGCXATTT^ 

TITTCTTTTAAGTTTOTGGTGCXnTXTI^^ 

TGATTTTTCGCTGCAAaXAGTTTAGATAOT^ 

CX3ATGTTTCroGAA aViTlCri CIX3TGAACAGTGAllU 

TAAGGTAATGTTTTCCTACX^yVAAACAAAGAGGGCTGftAACC^^ 

TCTTCTTTrrACACCAAGATAAAGAAATACTrTQC^^ 

TCATGGGTTTOnTClTrATGTCnT'ACTCATCTA^ 
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GCAAATCACTTATCCTCKrrGGTCCTCAOTO^ 

TTAATAGAACAGTTTAACTTGAAAGGAAATAAAAAAAGGAGACT 

QGATGAAAACmAACCrACaVTCATTrATTAAGAGAAT^ 

QGATGAGGAGCAATTCTTrTTTriTm^^ 

TCGGCTCACTGCAAACCrCTGCCTC^ 

CGTGCCACCAOWrCCGGCTAATITmGTATKrn'AGTA^ 

CriXSACCTCATCATCCACCCGCCTaSQC^ 

AGCAGTTCTTTCAATGGAATATATTTGAC^ 

AAGTGATGGGGTOSTACTTGGAATGACTQGCXrrAGAAAA 

GTTTCGGATCCCAGGTTCCAAACCTGCACTAA 

ATCTGTGAAATQCAGACCTTTCrrcGAOT 

TAAGeKn<XTAGAGTCTGGATTCTTT^ 

TCTATirCCACAAACSGATTTGGGGCATTAGQGACKn'AAAGGGGGAATAT^^ 

TIXyVTGGGTGCSGCCACAAAAATCATGTCAC^^ 

CATCTAACATAAGCCGGGTGCTTTGCTACXXSACAGAAT^^ 

AGACAGATGATAGGCATGGAAATGAAGAGACAAACCACTC^ 

TirCACTAATTCATCTTCACAGAAAGAAATAATATGA 

CAGGCTGAGGTGGGCTGGATCACTTGaGGCCGGGA 

AAATACAAAAATATTATCTGGGTGran«5AGCTTK^^ 

TrcGC3GCGC5GGGGCAGGGCAGGGGAGGGGGGAAAGAAGAGAAAC^^ 

TGTGCTTTGGAAAATCXXXTOTGGAATTTGGCTGA^ 

TTATTTTGCTTCTm'AGATCTGGAAAGACTAAQCGCAAAAT^ 

GTCTCSATAAAAAGGTATCAGTAGGGATCAITCATAAAACTAGTTAGTAA^ 

TACATACATCATACrrGAGACCATGTTCCAGAGAGTATTI^^ 

ATGGAAAGAGCATGCGCTTTTGACTGAGGAGGriCTTG© 

GGTCAACATCCTGAAGTCTCAGGTCCTTGTCTGCCT^^ 

GGACTAAAATCAGGAAGGCCTGGAAAGACmXSAAGCAT^^ 

TAGATTATICTGATGATTITACTACTCXrrATTATI^ 

ATTACTTATGATATCrrTTTTCTTACTATTTGg 

TTITirrcAGGTGGAATCTCCTGTCTCCCAGGC^^ 

AGlTCAAOrCATTCTCCAGTCTCAGCXrTOr^^ 

ATTTTTAGTAGAGACAGAGTITTGCCATGTTCA(^ 

CCTCXrCAAAGTGCTGGGATGACAGGTGTGAGCCACTQCCAATO 

CTTTTCriTGTTATACTGGAAGgrXZATGATGGCAQ^ 

ttcagaagtaaggtttttagaaaggtttgagttatt^ 
aatctggctcagttcaagttagctgtgtagcm^^ 
tagatctgtaactctgtacatttactcctcoxxxiagt;^^ 
catltgctataaattagtaaatcrctgtaaagctgcto 

ACyWWWVTKnCTTCCTITrACAGACAAGGACAC^^ 

AAGTGACAGGGATGGAATTGAATCTGTAirrATCAGAGGTG^^ 

ATCCGCATACATGCATGCCTCCCCTTAGACTCCAITGTTCCAA^ 

AAATGCAAAAGTGGACATTTACTCGAGAAAGAATSWtfiAAAC^ 

CTGGC3GAGTCAGTGGAGGGGTACTGGAGGAAGGCAGACTTT^^ 

TATTATTAITATITATTATTATTATGTGAGTTGGAGTCTCCTTC 

CrrrCTACAACGTCCACCTCCTQGGTTTAATCGAT^^ 

CACTACATCXiAGCTGACTrTrGTATTTTTAGT^^ 

CTCAAGTGATCTGCCTXXXriCAGCXriCCXX^ 

ATTAATAGAAGlTCTGGGACAACGGCCXrrGCriC^ 

TGTGGTCCCTTCAGCTCATTrTCTGAAl^^ 

GAGCAAAAAAGQCAGAAATTAATAAAACTCTATGTATTAGTTATCT 

GGTTAATACAATCCACATTCATTATWrATAGCCT^^ 

AGAGTCTCACAGOCTGCATTCAAGGTCTXX:^^ 

GGG<nCTTATCX:TGTTGGGTGTCATGAAGCCAAa^^ 

GCXATGGAATTGAGAAGCAGGAGCATGGCrcACAAATTCAC^^ 

ATATAGAcrmxnTCiTOrrccTcyr^ 

TGAAATOGGCAGTGAACTrcCCG<3AACCAGAGTCnTGCC^^ 

GGTGATTGTCAACTGTCTTGGCGCTGGTGAGGGGGCT^^ 

TCAAGKXrrTQGGTCGGCTATCTIXXri^^ 

CCTGTTTCTITWUWSATAAGCAGTGTTATTACAGGGTAGAA^ 

TCAAGACTCAGCAGGGAAGGATCCACTTGCAAGCCCACG^ 

GAAGGCTTCCTTTCCTCCCIGGTGT^^ 

GTAAGCCAAGAAOSAAATAGAATCCATTGGTAAAATGAAAGTCACAG^^ 
ACTGCTTTTQCCATATTCTITITCTTAGA^ 

TATGAATCCTGAGAGGCAGQGGTGATCAGGQCCAGCTACCACACTCT 

TTTGGGGACCCTtXrEACATTGTGGTAGATTTACTC^^ 

TATTACGGATAAGCTrGTCTmCl'iW i 'lV rnTi Ur^^ 

TGAGAGAGGCACCAAATGCACCCATTTAAAACATATATQCATTrCTGC^^ 

TAAGAAAGCrrATGAAAAGAATAAACAAACTGGTAGATAATGTATCCTAAAGTC 

TAGAAGGGGCCCrnnTAATCGATGGCCCAAATTGTCnT^ 



SUBSTITUTE SHEET (RULE 26) RO/AU 



wo 01/44466 PCT/AUOO/01539 



24/75 

AAAACTAGCCAGAAAGCGTGAACTTTTTITI^^ 

TGTAGAATATCTAGACTATTTGTAATCCTAGGATTTGTT^^ 

AATAACCACCX;GTTACTTO3CrGCTITrAGAAAATAGGT^^ 

TAAAGCCAACCATGTTAAAAATXTrATAATAAAGTATCTAATTTAGCAAAA^^ 

rTTTTATGCTTCTTOVCATGGCACCI^^ 

AlCrCTTGAATCATTTACATTITGAAAAAAGAAA^^ 

GTGTITCTTCAGCTTGATATCTTTATTTATTO^ 

ATAAACAGTCAGAGTgrCCAAGTAATCTGAGCTAGAAAGT^ 

GATCTAAAAAAAATACTGICTATCAACTCCTGAGAC^^ 

GAAAACCTGTAAAATTTGGGAACTTAAACTITTnTCAT^ 

TTTCATCTTAGAACATAACATTTCTTGITGQCTATTC 

AATATAAAAGAGCAAOTAATCAAATTACCATOGCrCMTI^^ 

ATAAGAAGATGCrTAAAAGAAAACATTATATTTCAAAAGAATTAGGTTCl^^ 

TTITCTTCTXXTTTTICTCXrrTCC^^ 

TATATTQGATTCXriTTATCXXrTAACCCGGAAGAGTC^^ 

TATTCXrATCTTCrGATGTCACATTATCrra 

CCGATTTCATCCAAAGCCATTTITTTTAAAATrAG^ 

CTCATGTTAGTCTGTAATGTCAAGATATGAAATGTATTTGA^^ 

TACATTCTGCAGAACTAGAAGATAAAGATAATAAAACAATTCACATTTCr^ 

OCAACATGGTGGGATGGGGAGGAGCTGAGAAGAGCATATTKXXr^ 

AGAAGGGGCTGTAAGAACACAGGATTTTGAAGTrGAAGAAACCAGCC^^ 

^TTTTtnTrTTCACnSTAAAATGGGGAAAGTAGOT 

3TGTATAGTGGATACTAAAGCAATGCTAGTCTTCTIC 
GTGTGCTACCAGrrAGAATGAATriTGCTGACATCTT^ 
ATTGATTTVTCTGATCCCCAAACATATCGAGTTACAT^^ 
ATTCITOXXIAQCCGCCATCATTCCTTCCAQGAAGGAAT^^ 
ACTGTCrmriTCAGGTCATATTAATGCAATGGTr;^^ 

ttcaccttcttctctgcatgttctttct^ 

aatgaatratgttggagtacaggcatatcccacy\gaat(:^ 

ttcggacatatttcttggattgtccgcactcata;^^ 

CATXXnKSTXTITCAGATTrAGATTTTTTCAA l 

gcagtctgocagcagaccagatggcagatctttt^ 
tgttccccaggctggagtgcagtggaksatc^^ 

CAGCCTOXXSAGTAGCTGGGACTGTAGGCATGCACTACC^ 

TACCATGTTGGCCAGGCTGGTATCGAATTCCTCACCT^ 

CAGGCATGAACCACTGTGCCTAGCCTTAATAAATACGTATITATGT^^ 

TCCCCTATGGCTGCTATTCTTTTCrAGTGAGCrATT^^ 

GCrCAGGCTAAAGTACCATGTCAAGArrATGGCraVTTGC^^ 

tcctitttcagcctcccgagtagtttggattacaggc^^ 

cgggtttcatcatgttggccaggctgcagctatttatal^^ 

gtggtagctcacttctgtaatccragcactr^^ 

cctcgccaatatggtgagaccctgtcrrctactaaaa^ 

ctccttgggaggctgaggcaggaaaatcgtttgaacrcgaga^ 

ttcagcctcggogcatgagcgagactatgtctcaaaaaaaaaaaaa^^ 

aaagacaacatgagctaicxggtagaaatqcgaatggacgcttgtagacagt;^ 

TATArrerGCAGACAGGCATATGCyVGTTAGAAAATCTS^ 

ATTTTTATCCTGAAGATTTTITTrTTACTC^ 

TITTTCTVTGGAAGTGTAATACAGGGTGAATTCCATTA^^ 

TAGTTAAGCICTTCITITATTGAAAGCCTTTCT 

GTTOrTCTACATQCCAAGriTTTACAGClTr 

GTAGATTGGGCAGATCTGGATTrCACXACAarr^ 

AATATATGCCAITAArnGATGAAAAATCAGTATTTATTTATrAA^ 

AGTATTCTAGAGTTTCTCGTTTCTTGAGGAOTG^ 

TITCGATGACTTTGGAAGCCTTGATTTCTGTAGGC^ 

GCrCTAAAACAGCATCTCTCAAACTTITI^^ 

GTATAITCGTCTCTGTGTGTGTGTGTGTATTATATGCATGI^^ 

CTTTrATCTGAGCAgKnCCIXXTO i ' r 

TITCTTTTTGAGACAGAATTTCACT^^ 

TTCCCy^GGATCAAGOGATICTCCTGCCTCA^ 

TTTTITTTGTATTTITAGa\AGGTIX^^ 

CCCAAAGTGCTGGGATTGCAGGCTTGAGCCACamsC^ 

GACCCAGTTTCAATACCAACTGTnxrrTTC^^ 

GCAGTCACTGCXrCCAATACTGTITGATGTGCTCCC^ 

TAAGGCATAGCAAATQCAAAAATGAGCAAGGTGTTCTCT^^ 

GAATGTGAAAGGGAGGTGAGATGAGCAGCACTGGGATGTG^ 

TAAAGTTGCAGAACACSAGATGACXmOTrACTGAGACTTGA^ 

AATTITAGGriCTCnTOn'AGeiCTGG^^ 

TAATGAAGATAAATGATATTCTAATCTAATATTrrAAAAAAlX:^^ 

ATTTTGGGAGGCCAAGGTQGACGGATCACXZAGAGTTCAGG 
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CTACTAAAAACACAAAAATTAGCTGGGTGTGTGGTAGTGTGTGCXrOT 
MTTCCTTCAACCTGGGAGTAGGAGGCTGCAGTAAGCTO 
ACTCTCnxnCAAAAAAAAAAAAAAAAAAAAAAAAAATTCCAAA 
TCAAGACAAATTauVTATTATAGOW^AGGTTC^^ 

AAAAAAAAAAAAAGGCCACCAAGGACXrrcATGTTAAAGTTGATGATGAC^^ 

ACTAACTirrATTTrrACATTATTrTAACTAGTATTIXn^ 

CATAGGCC>3GCAGTGGCTATAGGCCAGGACCTCn^ 

ACCGAACXyiGCTTAGAGGGAATCAGTCACGTGGACAAGGCT 

CMCGGACTGAAGTGCCCAAC3CTCTAAATCCXrrcrc 

CSriTGTTTIT'ATAGCAGCAACACAGTAAGTTGGGAAGAGGGA^ 

GTTGACCACTTAGAAATGTCACGAGTCTCTCnxy^ 

TGTACAAGTAGCAATGGAAACAAACTTGTTAAAAAAAATTTAAIUTIU^ 

AAATATITGATATTTAAGATOSAGCAeiTAACAAAATACT^ 

ATTCrT<nCCCAGGGATACCTX3ATAGAGCrAATATAGAACAATAl^ 

GGTAa<n'AGTrCATTITrATAAAAAAATTrAAAAATTCT^ 

tgatgactcttaacaaaggatgtaagacagattctaagaagagaaaaggtagaa^^ 

ttngtaaaaattatcatgcccattcattatttaaagtgctgt^^ 

acatgatttaaaatatgtaatgaaatttgcaatcx:atcagatgat^^ 

tcacocitatgaagagttacttgcattgtttgaagaggg^^^ 

cactgccttaaatgatxsctactggcaggatggttaattix^ 

GAATCnrcriGTGCCGTTAAGGGAATroQCy^GriG 

CXriWirrcTrTTOTrTCTGTTGTCnTCAT^^ 

GAACAGGATTATCCAGAGGTAGCAAAAGCTCTCCTGCAAATATTC^ 

TTICArrCAGTTCAGATTrcCTACTOOTCTC 

TCX:ACCAACCAGGTATTGCTTCGTGTCriTOyi(^ 

AATAGAAAAACTGTTGATTOTTTTTTrcATTCA(XA^ 

AACAGCXXnTTTTATAAGACTCTGCATTrACAACATTTGT^^ 

TCGTGTGCGGTGGCTCATCCCTGTAATCCCAGCACT^^ 

GACCAGCITGGCCAACATGGTGAAACCTCATCTCTA^^ 

TAGTOXIAGCTACTTCKiGAGGCTGAGGCAC^ 

CTACTGCACTCCAGCCTGAGCGTGAGACTCrrcAC^ 

TGTAATCACT^GCrATCCAGAGGCTGAGGCAOCAGAAC^^ 

GCCACTQCACTCCAGCCTGGGTC3ACAAAAAAAAAAOT 

COy^TCCCCTTG'lTlCTAGAAGAGAAATCCCTCAC^ 

TGTTTTATTATATATTCTGCCCACTTCrrATTOCC^^ 

ttttgatacagcxstcttactgtgttaatcaggctag 
ggctcaagcaatcctctggcx:tcagcctcccaagtag^^ 
attttttatagagacaagcmtorctatgttcxxx:^^ 
tctcacaaagiqcim^attacaggtacaaoc^^ 

TATTTTCTTATATTTACACX:ATATGTGGTCATTTAAAATTATr^ 

TTTTAGCATTAATATTAGGGAAGATAATCTGACATn'AAAAATACAT^^ 

TATAAAmCCAGTACTAGTGTCTGCTAATAGCnaXrGCCACr^^ 

AGGTTGGCAAAGGGGACAGGAGACTGTGGGATGATTAGTATATTCTTAT^^ 

TCCAACAATAITCATIClxnxSTGTCACCTCAGCATTTAATT^^ 

GTTGGTCCTArrAATTACAACCTAGAGGGATTCTATOAGTAAACT 

GACTTTCGAAGGACTGATCCGCAAGGGCCTTCTGr^^ 

GTTTCCCATAAOCTTAGAGATTATCTAGGCCAAGGCTATTAATTTA^ 

ACTGTKOTAAGTICACACTGAAAGAGAGTGGTGTAATGCCCTT^ 

TAACGGACATQGTATCCCCAGCATlTrrAATAAATOCCCm 

AGTGGAGCGTCATTTCATCTITCATCAAGAGAAGCAa 

AAGGTCATGCGGTATCTGCCATTTTGCATATTCTTQCTrT^^^ 

TGGTATATGGCTTAATTGTAAGCGATCAGGCATCAATQGCTT^ 

CATTATTATAGATTQCTGATGTTTGCXIAATOGATA 

AATAATACCAGTGACATGTTTGCATTTACAriCA^ 

ATTTTATrTTATTTTAATTCTTTTTGAGACA^ 

GCTCACTGGAGTCTTGAOCACCCTGGCTCAGGTGAO^^ 

CCAGCACACCCAGCTAATTTTTOn'ATTATTTATAGAQr^^ 

GGGCIXy^AGCAATCCXXXX:ACCTrGGCC^ 

NGCXrnxXCCTOCCCITCCCTCCCTTCCa^^ 

CTGGGCICACTGCTiACCTGTGCCTCCXXKS^ 

ItSCACCACCACAaXAGCTAGTTTTTCrrATT^ 

CTGACCTCTCCTCGGCCrrcCAAAGTGCTG^^ 

TAATAACCTCTCTTCAGTCTACTrTTTCTC^^ 

CTCCAACCCTGAATGTCACTGATACOUCACCATA^ 

GAATGACrCAGACTTOGAGCTTGAAAGAGGGAGAAC^ 

TGCTTTTCAAGAACTAAAGAGTAGACTTGACCACI^^ 

CCACAAGATGGAGTGATCTTTACGCCATCCTCCTGAC^^ 

TTGCKrnOTACGTGAGGACACATCACTOX^ 

AATAGGCOCTCtrAGAmTCAGTAGTGACAGGGGTATCM^ 

CTACTGCACAAGTTTaVTACTreAAGAGTGGAC^ 
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atkyvgatticactatccatgaatrctggccttaactt^^ 
ogttttccaaatgaggaactcatcggtaaggccagaaagga;^ 

GAAAAAAAGAAAGAGCGTAriTATAGATGGTAGAGAATlXrroGGCAT^ 

TAGGTGGGACTGCrGATCTTtmTrATGAAAGAAATAAAGGO^^ 

TATGTCTAACCX:AGTXyw:AGGTAAGGCAGCCAGGAGCTGCT^ 

GCACAGCTCAAAAACXTTCACAGGTCACGCC^^ 

ACATGCCCTATGAATTATACCTTATTTATGATCGTCGAGCCCTC 

OWTITTTTAATACTCTATTTTVTTITACXSOC^ 

CAGTTTCCCXrAOCATTTAAGGAAATATTGriXrrAAGACT^^ 

CAATGGAAGGACCCrCAACAGACX:ATTTTCAGCATTT^^ 

GTAOOTITrACATATGCACATTATGAACIWrGACCAGG^ 

GCTTAACAAGTTCCTCTGAGTATTACTyGTIK^^ 

AGAATCrCAGCGATGATCCTAAATATCTTT^ 

TAGCAGGAGCTTTCAAAGTCTIXalGATTGATCTTlT 

GCAAGAGAACAGTGATITAATGAAGICTCTAGTGAGCX^ 

GCCAGCATAAAGGAAACTGCACTCACACTCCTATTAAGCAGGTATIT^^ 

ATCnGAGCAAAAGCAATGCCTTACCTCTTCAGT^^ 

GGGAGATCAGAGGTCXXSTCTCrAAGTCGCCCITAGCGGA^ 

TVWiTroGTGATTAGAACTGGAAAGGTQCAATAACATCriTAATACAAAC^^ 

AAACTCGTCCTCXXrGCXrKXy^TAGTTACGC^^ 

CATAGACAACATCTAAATTTTTGTTQCTIT?CCTGGGT^^ 

CTTAGGACATACTACAGTAACCrmaXACCACCGT^^ 

TGCAGGCGCAGTGACATTrATGCACTWlTCGCTT^^ 

CAGGTCTTrcTTTCATGTCAGArcCGTATCCX:^^ 

tgaaaacxxtgaaagaaaataaacaggcagactcaggaattatac^^ 

tttaggtgcactgcgcatgttggaggitxxx:ti<3^^ 

aaarrcttacccgatgcttaatgaatkxxaggct^ 

itoatatagactggagatarmxiatgcctogcaga^^ 

gtccagttgtgctgagtagctgtcattattgataa;^ 

tcaggatgcatagtagagaaaaatgagatgaggtaatgagatcaca 

aggttctcaataaatcttttttgattct^^ 

tccataaaacttgggtaattctaatgcttftcttcatc^^ 

gagtttaaagaagtgttttatgagggagttttgtttgcaaa 

ccttttctagtaiggacctttggggctcag^ 

gagtctitcrrccaagggaacagaagtcttaqgttgttgatc 

tgtagaggagatgaagctgatcagagcacacccacaccagtgac^ 

atctgagagatgtggcatgaggaacactgagacctaatgc^ 

ctagtactaagcagaatggaagggaaggatqgaaggaagagagggagagacggagggagat^^ 

ggagtgagcxsuuaagaaatagggagaagtatagagagga/yvgaggggaagggaggaa^ 

aagagagggaagtaggcacagaggtagagaggcagggagataatggagatcctggcc^^ 

tagtccacagaaagcaccttgaccaagtgactgaatj^^ 

TCTGCCATATTTCCTTGAATCTTCATGACGATCC^^ 

CTATCTCAGAAGCAAGTCAGGATTTGAAGGCAG^ 

TCrrACGTTGCCaXIACCCGriXKMCAAri^^ 

CCTTTlTGGAGAAATGGCCACTGCAACTCr^^ 

AAAGAAGAAATAAAACATATTACTCCATTCCCAT7W7rTCn<^^ 

GAGATTTACTGAACTCAGTCXSTTTCACOT^^ 

CATCTTACATOGCAGCAGACAAGAAAGCGTGTGCAGGGGAAC^^ 

CACrATCAAGGGAACAGCACAGGCAGATC^ 

GAGTTCTGGGAACTACAGTTCAAGATGAGArrTGGGTGGGGA^^ 

riTCTAAGGCAGATGAATAATCCTGGTTGGAAATCATACl^ 

CTCTCCAACTCXrrAAGCTCAAGCAGTCCGCCTG^ 

CAGCCTGGGGGAGGGTTTGTTAAAGAACTGTCriKnTra 

TGATACCTTTICATTAGTAACTCACATCGACC^ 

awXTTATAAAACCTGrriTAGTAATTTACTCCC^^ 

GCCCAGGTAAATAATAaXACCATCAGAGGGCACX:AGTAG^ 

QATTTTTTCTCAATAACTGATCTrTA^ 

ACACATTAAACACTAAGAGGICACATAGGATATTCCATTrATATAAAA 

TACGCAlCGGTGGTTIxnGGGGCGGCGGGGAGTATGGAAAGCC^^ 

TTITCTTTTITTTITCAGACGGAGTCTT^ 

TCCXX:CICOCAGGTTAATGCCATTCTCCTXC^^ 

CTAATrTTTTGTGTTITrAGTAGAGACGGGGTTTCAT^^ 

CCTGCCTTm:cnXXX::AAAGTGCK3^ 

TGGTGAAACTCTTTTQGAACTAGAGGTGATAGCTCTAC^^ 

AAATAATTTATGTTATGTGAATTTCATCTCAGCAAAAAG 

ATTAAGCTGCTGTTGTTGrmriTAAATATTCC^^ 

TGAGGTGGGTGGATCACTTCAGGTCAGGAGTTCGAGAT^^ 
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CXOTCTCTACTATAAATACAAAAATTAGCCAGGCTTGGCGC^^ 

GGAGAATCACTTCAATCCGAGAGCXyVGAGGTTGCAGT^ 

GAGACTCCCmnCAAAAAAAAAAGCACAAAAAAl'ri'lTlUClU 

CAGGAAATAATTCGGTOH'ATGTAATGCCCAGTAAAGTAAACAi^ 

GCTITATTCACAGAGCXrATGGGGACACTAAACGTACCTTAT^^ 

TTATTAATAATGTATACATTGAAAAATCTATGTGCATTATAGTCnT^ 

ATTTTCCnCriTn'AAAAAAAATTTTTCAAAGAC^ 

GCrmriTCTGATCACCACCCATCTrcATT^ 

CTGTATATACTTCTATGCGCACATACACATATTTrAAGGAAAGAAT^^ 

Cy^XnXSAGGAATATTOyU^TACAAGTC^ 

AOSADCTCITrTCATrGTrcTTTTATT^^ 

GAATQGTCTTITXn'AAAATGACITTGAAAACTC 

ACTGTAACTTGGCTTICATTAGAGTAGAAAGAGATAAAGCT^^ 

TAAGGAAAAAGGGGACTGAAATTAGTOCTGATAAAAGCrrGAATTAGACA 

ACrmSAATTKXSAAGAACATACTTCATAC^^ 

TAAAATGACTCCAAACTCAGTGATACTOATCACCC^^ 

TtrrcACATCTGTTGACAAATGGGCAATC^^ 

AAAATTCTCTCrKXriCTTTTCCAAGa^^ 

atggcxtcigggagtcx:aagaactacctttacaci^^ 

tgctcaqcttctcntitctggagacci^ 

ttttititaaacttcatggtgagcagaacacnxstac^^ 

accwscttccnctcaccttaacccc^^ 

gagttcacactgagtttgcagattcy^ 

ggoccyvttgtcggccctcttagaagagatttcaggccntcc^^ 

gaaaaagcacatgacttggaataaagacttagtacacatccxatcnc 

ACGTTACTCXyVCCTGTTTGAGCCrcAGriTI^ 

GGATCAAATGAGTTOSAGTGGGATACACXAACAATCT^ 

TATTACACCACATCITCrcTGGGTCITIC^ 

CTCTACCCCAAAAGTATATTCAAGAGCGAAACTTirrOTrAT^^ 

AGCTATCTITCITGAGTCTCTCAAAAGACA^ 

CTTACTCGTTnCAAATACTCAAATGCCATTGAGTa^ 

GAAGTCACCACATGCTTCTTATGGAAATCGT^^ 

ACCCATCAAGATATCOVrCTGTAAAATATITAAGAACTGG^ 

TAAGAGCAGAACAAGGAAAATATAGCTGACTATAAGTrACAAAGGCAGACTC^ 

GAGCACTTGGAGAAGGGCTGAAGGCAAACCAAAAGTGAGCAAC 

CXXAGTGAGQGTGATGATGGTGTTACGAATTACCXIATra 

GCAGAAGGGCTITIGGATAAAAGGGGTCIT^GTrAAGAGATC 

ACATGTCTGAGTACCGCGCAGATAGTGGGGAATAGTCTATGGAIT^^ 

GTTGTGTTTITITTCCICTCATT^ 

GTGCTAGCCAGOCACAGGGCa^AGTGACACCrrAGAGG^ 

ATCTGTAACCTGGGATTAGATAGAGCCAATAAGATGTAACTGAAATTCTC^ 

GGTACGlTAaTGAAGGTGATGCTAGCTGCTATAACAAATAAGCAC^ 

ATTTOTTGTITATGTAAGATCCATAATGACTOTGCC^ 

AGCTCTTTTCATOT<nXX5CTCAG^ 

GGAAGAAGTAGGAATITGCy^GGACCATCTGTGGAGCATTT^^ 

AGGAGGCTAQGAAACAGAGCATAAarrTGTQCCCAGGAAGATGCG^ 

TTTGTTTAAAATCACTTTGGATTCTCATTTACCIT^ 

TAACTAAGCATCXrTTOCTTCCCTCTACGATITCAT^^ 

TATATGTATGTAOtXATGTATGTATGTATGTTTGAGACAGAATC^^ 

TCTTGACTTACTGAAACCTCCTCCT^^ 

ACGCCACCACACCIGGCTAATmrGTATTT^ 

CTGACCTCAGGTCa^TTCACCTACCTCGGC^^ 

ACCCXACTGCTATTCICCyiGTGTGGTAGGT;^^ 

ACTTCACAGCCTTtX™x:AA^^ 

TTTCCTTGAATACCTTTTGCTTTCT 

GATTTCATGCTTCCAGCCTTAGCGGrETTC^^ 

AGGTCCTCACGTTGCAGAACACTGACCAATGTTQQCAACTAC^^ 

TTCCCTGrarrOCCAATTCTTCAGCTGCTAATAAACAT^^ 

TAGCCCTCGGAGATGAGGATGCTITOCGAATGTAACAGGCATC^^ 

TAATTAAGTTTTCCTTCAAATGGGGAACT^nCCTC 

CTCX:ATanTia3ATCTTrTTCACCA^ 

CCATGAAATCTCTGOiGCGATAGTTGAATGTCTTGC^^ 

CACATTTATATTAGGGTAATGGCCTCATGCTATTTGAAAATCT^ 

AAGCAATAGTOXTCAACATCATTTGGCCTGAACTAGQCTAG 

TGCCAAGGAGTGGTGTCATATCATGAAGTAGATTTATGTACAC^ 

AATTTTTTTTTCTATOCACCAAAATATTCC^ 

TGCAAAATCCTTCTTTAGAAATGTTACCTAATTCCAATff 

TTAGGTGTTATGATQCCAATTTGCCCACACATl^^ 

TCAAGGTTAAAACAGGAAAAAGTAACACAGTAAATACCIATTTT^^ 

AGCACCTATACAAGGAACTQGCTGGCATATTGCATTCCTG 
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GTGGGAATGTGGTCMCCTCTTATCTCATAAGCCA^ 

TITH'AAATAATGAAAAACTrKKXXXXnTGTT^ 

TCTTAGTAGAAACrrAATTTCTCOTGCTTTTCra 

GAAAAAAATGATGAGCCQGGCACTWCntSGCTCACG^^ 

AGGTCiUSGAGTTrGAGACCAGCCrGACCAACATC 

TGGTACACAGCTGTAGTCXXlACXrrACriTCSGGAGGC^^ 

GAATTATGTCATrnCTTGTCACAGCGTCAGCAGAATAA^ 

CCAGTCrrcTGCTTTAGCACGGGAAAAAGCAAACTAGGGCCrT^^ 

TGCAGGAGAACATTITTAAATTGCntSAAAAAAT^ 

AACTTGAGTGTCCTTAAGTAAACOTTrATAGGAAC^^ 

GAGTCTCACTCTGTO^CCCAGCXrrAGAGTGCAGT^ 

ATTCTCCTGCCICAGCCTOOCAAGT^^ 

GAGACAGGArrrCACCATGTTGGCCAGGCTAGTCT^^ 

TGCTAGGATTACAGACATGACX:CX3ACCTmTATGTATT^^ 

AGGTTAGAGCACAGCATXSAAGAGCTGaCAATATTGACCC^ 

AGATAO^CTTATCnXnXSAGCAGACGTGATTCAGGGGAGA 

ATCTTCXXyVCCAGTCCTGATATGGAGCGTin'lTinUX 

CXrrACTCITTACTCCX:AGGTCIT^^ 

GAATGCCAACAAGAGTCACCICTAAGATGATCCAGCTC 

CCATICTrcTTCGCCGTTCTTACX;GTAlTAATTTAC^^ 

QGAACTICTGTrCGAAATCAATACTCTTGAAATAT^^ 

TGATAATAAGAATAAAAGAAAATGJUUGTGACATTGAGGACGT^^ 

OGCTGTAAAATGJ^CTGTTACAOSTTTATGGATTGA 

ATTACTAACACCTCTCACATTGTTAGAAAACAGTTTTATGITI^^ 

AAAGTrCTCATTICATGCXrTTAGGAATCCCCTGC^ 

tkxacatcttaccccgatgcactctct^^ 

tcatttgtcagattggttoaggtagaaaataaggatr^ 

ttttctttaaagctgckotactcaaacgac^ 

titacatgctatacaattcatxxatttaaa(nk3t^ 

agtcagttttcttacctgaaggagaaagcctgcacccc^ 

ctctgcgaaccgctagtckxnctocatctc^ 

txnqgtictttgtgagtqgtcctltcacttt^ 

ttitrcttacaaatacatrtttci\aaatgagtaaac^ 

gagattcatctttgtagcx:accaagagccactaacagt^^ 

AAATCTGTGACrcATAAAACCACltKriCATTACT^ 
AAAGGCTGGCnxrATTAATAGGATATGTCTOGAGTTCTGC^^ 
TCGAAGCTGCTGCTGCTTCTTCATCTTTATCTTC^^ 
CTCriCCTTGTrcyWXCAGATTt^^ 

GGATAAACTGGGATTTGTAAATATTTOXnTAGTTTT^^ 

TAAGOrcGTCeiTrATAAAGTTTAGACCTATTTTAATAAGGC^^ 

CITGATGGAiCAATGAGTAClCACICTftCTIT^ 

T?yXCACXX;TC3GAGTGCAGTGACATCATC^^ 

GCCTCCCCAGTAGCTGGGATTACAGGCATGTGCCACCACACCm^ 

CATGTTAGCCAGGCTGGTCrTGAACTCCTC^ 

TTAAAACTCTACCnTTTAAAAATTAGAACCTGGACAAAGAGGTACT 

TCWSAAGAGTTTCXSATTTGGACXrrACCTAGO^ 

CTGTT^AAATGATCACTCCCCAGTITGACAGGAATTGCCC^^ 

CAAATCTOyXTGTGATACAGGTGGTGAAGCTACTG^ 

TCGlTITCAirrGCATTTCTCTGAraX^ 

TTTGAGAAGTGTCTCATCATATCXriTCGC^ 

GGACAAAAAACCAAACACXXXy^TSrrcriCACTCATAayiX^ 

GTTAATOGGTGCAGCACACCAACACAGCACATGT^ 

TGAAAATATAATAAAAAATACATATTAAAAAAAAAAGAACTCGAGCAT^^ 

CATGATCACAGAGGATGAAAAU' i ViljCTI A AATTATAAGCTAGCT^ 

AAAGTGAAATGCATATCTCTAAGGGCTAAGTAGCCAACACAATAGGCAAT^^ 

TGTTTTGTITCGTITTrCTTTT^^ 

TTCTCXriTrOCGTCClTCCTCCrrc 

TrGTCTClTXX3CTTTTGTIX3GTAATTAAATA^ 

GGCACOCAAAAGCATAAGAQCCCTTGAAAGCAATATCTAAGCATAGATO 

CTTTTAGTX7I7UVTCGAAGCAGCAAGAGTAGTCACAGCA 

AGGGGATQATCrrrCTAGTCTAGATTACCTATT^ 

AACCTAGCAAACCTTTGCATAAATCCTTTAATTGAATr^^ 

TTCTTmrrAAGTGrrACTGTGTAATI^^ 

AACCTCACXriCTGGGCTTCCTTAAAACTAAC^ 

AGGATTATGGAAAAAATAATAAAACT?ATTTCCaGGGGAGAArrT^ 

TTAATAGTXSAAATCATATCATATATATAAATCATATTTTAGCCTAT^^ 

CTTGATGTAAAACXIATCm'ACGTGCGGATAATCTrTr^^ 

CAAGTCAAACACATTAAATGGTGGGTTlXiATCCAAAAAATC^^ 
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ATTAGTGGGAGGAAGTATAAAAGATATGGAAAAAGATATTCTGGTTAIGTTC^^ 

AGGQGAAA<WVAAATATAAAAGCTQCAATAGGTTTTTCTATTT^ 

ATOCCCTAAAGAAATATTCGTAATCAATAAGAACACAAAATATG^ 

TAGCACAGAATGAAAAACACAGCAAGrrTATCTACX^ 

GATAAATACAAGTAGTAGCATTTTTAGTAAAGCCTCOTACAAAATACrcTA 

TCATTAATGTiUW^CAAACACAGTCTATCCATC^^ 

AATAACTGAGCTCATITGATCTQGTTCTTCCTTAAGCC^ 

TTTOtfyW^ICCGGAAAGAAAAAGCGATGATAATC^ 

AAGAACAAATTOnTCGCAGCCTACTCTO^^ 

AGCCTTCCCACCTTTGGAAGCTa::ATTAATT^^ 

GGCAGGAAAGAGTTTTAATGCAGGCTOCAAAGGTCKOTAATTT^^ 

AGCAATCTAGCAGTTCAAACTAGACAGCTGGTTCTGGT^ 

CTCTCrOCmXXOTGGACAAGC^^ 

ATTTQGTITTGATATTTCCATCATQQCCX:^^ 

ATCAACAAGGCATATTGCXIATGACTCTAGGAATCAC^ 

TAGCAirACTGTTTTCCTGAATGTAGGAGTATTTCXSAAAAGG^^ 

GTGTTITGTTTTCITrcTGTTrGGATrAA^ 

ATGTCnCIGGO^CCACTTCAGACAGTAAGGATATGGATAT^^ 

GAGAGAAAGATTAAAACATATGCAAATAATTITGATAGAAGCAAAGACCTAAC^^ 

GAATOCTITCATrcTAATGTTTTAAAGCTITTTATT^^ 

TAGATCCAAATATGOTAGTACTGTGAAGArTATGGTTI^^ 

ATTTTCATAGTGAAAATGGAGATAAGACATXSTCXCAT^^ 

AAACTGTTATAGGAATGATTCACATCGAGCnTCATCTCTA^ 

GACACGAAATATAGTGACAGCCTCTCGCCTQGGAff 

GATAAGATICAAGACCCAAAACTATCAATn«3AAAGC^^ 

ACCCAGCGGGGCTIXATrcTTCACTGCCCCCCTC^^ 

AGAGAAAGGGA<^X^•AAAGG^m3AGGATTCX:AlXXXrmA^^ 

TCATACCTTCTCGAAGGCAACTAGGTQGATrACAGAATTGTGC^^ 

GAGCTTCACIGTCXIAATATQGTAGCCACGTtnXS^ 

AATAGCAGCCACATTICAAATCTTCTCTAGCCAGATG^ 

TTCTACCATAGTAGAATGCTCTTTTGGATAGATCTGC^ 

GTTCCGAGAGAAAACATTATAAACTTCTGTTIATCCATTAA^ 

GGGTCGCAGGTACTGTTTrAAGTGTTTTGCATAT^ 

GTTGTCCATATTICACAGATCAGGAAAGAGATGTGC^^ 

TTTCAGGGTCCX^VTACTTITCAATGTAAGCATATn^ 

GCACrxyOOVGGTACCCTCrcCAGTCCTCTGATGGT^^ 

GCCATaTCCCCJ«5GGAGAGGGTCCATGTTC5G^ 

ATXnxnXSCITAGTTGGATCXyiTTGAGAC^ 

TTACNAACATITCTGCAAAAAGATCAGTGTTTOSAT^^ 

TACTTCACCTACTCCTAATAAAAGCCCTTCATCT^^ 

GAAAAACAAAAATCTAAATTAGCAAAGAGAAAirrGAAATTGCATCXrATTAa^^ 

TACCATTCCTGGAGGTAGTAGCTAATAGAATXSATCXIIAAGTTTATAAT^^ 

CCCTTAATTTITCrAACTGATGAAAGCGTACTTC^ 



aggatatcx:tttcaatagaggtgaaaatcctgaaataaaaggttt^ 
ovtggcatagatcxrrgaactittgatckks^^ 

TCGCCTCXXACCACCCCTrAATTTAAAGGGATTTA^^ 
CCTAGTTGATGGAATrcTGAAAAAAAAAAAAAftGATAGAATGTTTTCa^ 



TGTGCGCGCGCCTQCATGTGTACTGTGGATGTGA^ 
GAACCTTTTCTATTAATCTTCTTGGGGCT^^ 
TACCCAAACSATITCGTAACTITCrcACATC^^ 
TTTCATACAOOTTCAAGGACTTAAGAAAT^^ 

TQGGGaATAGGTGGATATrATACATTTATTTAGGTGGAATGGTGGGTC^^ 

CACAGaXTCATTAOTOAACKrAGAGGTTC^^ 

TTATTATTTTTTCAGACAGAGTCTTGCTC^ 

GTOSATTCTCXriTCATCGGCCTCaSTACT^ 

AGTTAGACACCCSGGTTTCACCATGTTAGCCATUrr^^ 

GAAAATCXTTCGGATGftCJ^GGCGTGAGCTGCCT^^ 

TIXaATAClTACTCGGCXXTGTCXrrAAGTGCI^^ 

AATTATCCTGAmtXCAGGCAGACGAAACr^^ 

TCGCAGAGCACTTATTCGCGCTAAGACTGTnxrr^^ 

TTTCATmrcrrGGGAAGGATCTATCrrCA^ 

ATGGTACAATCriGTTAAAAGGCATGAAAAAGAAACyVGTCATTC^ 

CACTGAGATCTATXXy^TGTTGTCGGCCCACrr^ 

AGACATGAAGGGCXriCITCAACCTACCTTTTC^^ 

TCrrcGATATTCATCAGTGTGTTAGACTGCTr 

TTTTGATTTGATCnTGAAGGTCATTTGAAAAATACTC^ 

TGTCACirrACTrGGAAGAGACGGGTCrrcAGGGAG^ 

CX5CTGACATGGAAGATITTTTTTnTO 




^ACTCTGTIT'ATCCTCTGCCTACTAGA 



CAAGAATCXrrATCW7mTOCTCAGGGCAA( 




JTGTGTGTGTGTGTGTGTG 
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AGrnCTTAATGGTCTTTAQCAAAGGAAGTMGTGACGC^^ 

QCTAGCTCyVTIXXrCAACAGAACCGGCCTTKX^ 

ACCCXXriTCACATACTTATACTGTC3GGCia:AT^ 

AArAATGTGTGGCX:AATCTCAAAAGATACTAATICAT^^ 

ACCTTTGGAAATCAAGGCTCTTX^IXXnTTAT^ 

CAGCATCATGATCX:AAGCATTGACTTTGCrc^ 

GCTGTGACAGAACACGTGGCCCCTACSriGTGTC^ 

GACATAATTTITGTTTTCTGTTAAAGGGTCAGT^^ 

CTTTXWVGGAATCAAAATTAACTTGAATACATTGTCl^^ 

TAGGAGCTGCTCTTAATGATTATTACTCCATAGGCTGGAATAGA^^ 

CriTAGGCCTlXXXATGTATrAAAGTACTATACATITC 

TTrCTTCACAGATmriTITCCAAGrrATA^ 

TTTGAATTGGTCXSATATACTCrrarAACTrcTTTAAGA 

GATTAATCATTACAGTAAGACTTATTAGCIGCATTrATAT^^ 

GTATACTTAAGCAGAAGTTGGGTTTTAOTITACAATTT';^^ 

GTTGAGTGGATGTGGGGCTlTAAAaXAGGAGTCICTAA^ 

TCTCCCAAAATGAAATACAGGAAGTGGACCAAATT^^ 

AGTCGAGGTTCTGGAAGATGGGTGGGGTCGGGGGAAQCGAATGTGAAGTGC^ 

TTACATCAAGGCTCXX3Aa3AGGTCTTICCACTGff 

CACGCTCTGCTCTGCCCTGACTTCCXSATC^^ 

TGAAACTCrCTAGGATGAATCmXMAAGTCXrrT^^ 

GGTTTAQGTTTACATGTACTTTAAAAAAAAAAATTAATAAAGTTTATT^^ 

AGTGGGAAGTGTAGAGTTTCATATACTCCCTGTCCTCAC^ 

CTACATGTITCATTGAAAATGATCTTTTGGGTGGTT^ 

TTTTGACrcCCTAAGGGACCCTGTTTTGGG'l"^ 

GGAACCAAACTCTCTTTATTCCTCCCGTTTGGT^^ 

CCCCTTTCAAACGTTATTCCACAACAT?^AAATrAAa^ 

ACAGCACGCATTTGTTCAGCTCTCntST^^ 

TGCTIOT3ATTTGGGGCAGCACCrrr^^ 

AGGATCCCTCTCCCCAGCATGGGCOTICATACAI^ 

TTTTGCGAGCXriTATTCAGAATCnCAGCATC^ 

ATACGrrATTTATGGTGTTTAATTAriATTGTACAGATTTATT 

AAAAGAGTTTGCTAACTTTGAATCGTAGATCTITTrATCTGAAA;^^ 



AACCCKnriTCrAGTATTTGCTCATTACTGT^^ 

TGGAGTTGGAAAGAGATGaTrAGTCCCATACCAOTITATGT^ 

TAACCTCATGAGCATAGATAATAGTCACATCTACTAAGATTAT^^ 

AATATAGCTTATTGAATCATACATTTATCI7«riTrAATA^ 

TGATAGTITAGCAGTGlXXrrCTCTTAAGCCAGr^^ 

TITTGTTTTGTIlTCTrTTGTTrro 

GTGGTGTGATX7rCCGCnx::ACTGCAGCC^^ 

TTACAGGTGCATGCCACCATGCCCAGCTXaATTITrcTAT^^ 

CTTGAACTCCTG?\C1TCGTGATCCA£^ 

CATTCyrcTGICAQGTTTAAATTGCACCTTGQGTT^^ 

GGGCATCCCltXytfMTTTTCCGCTGGGGTSTGATGAGC^^ 

CATCM*ACa:TGCTTGGTACATGCXAGATCATT^^ 

GAGATCATTCTATAACAACAGCATGGTGACATCAGAOT 

TTATCTTTTTKrrXiZAGTTITTl^^ 

TTTCCCTGAATGACTAAAGCAGAGATTTGGITATI^^ 

GCATTATTiaXXZAATrCCGTATIXCATCTACrrAATA 

aSAGTCTOTATTAATOTTCCGTCTTTG^ 

GGCTCCCCTGTTATC?rAlTroGGACTTACTGC^^ 

GGAAGGTGGGTCGTITCTACTGGGCATCTGrcCTGTATC^^ 

TACCATCTCTCCCAGTAGGGATAATAAAAACATCTCTCC^ 

CCCTGGTTGGftAACTAGTGTTCCAGAGCTACATra 

cttxxaggaaccai:x:aatcccccaacactaggat^^ 

aagctggactgcagaaagaaaagcagtatttcrraccc^^ 

ggatncctcmaaatggggcttctaaaataagtgg^^ 

gcctattttcrigagcigtcgagcagggaga^ 

ctgctaoctttcxxngcsggatggagggaacatgg 

aagttggcatagtcttagcagttagtcxragaigtg^ 

tgaaataggctcagtctagcttatgtctaggtggctgagr^^ 

attgtgtaaaacctqcacagtggtacacctgagccc^^ 

ccctctttaaggatttf^gactttctgatt^^ 

TrTGGCCCTACCTGGftQCCnGAGTCCTCMOT 

CTCTCAGTCATTAGGCGTCATCTGAGACAGTGGCTGTA^ 

TCTTCATGGGCCATTAGGAACTGTGTGACGCTGGGGCA^ 

ATAGGTCTAATGATATCATTATGGCAGATCAGTGCXSACXSATTC^^ 

QGCTTTCJCTGGGCCACATCTCTrGTC^^ 

AGGCTGCCTTGTTOXTOCTCCATGTAGAACACAGA^^ 
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CCCTTTCAGAAACACTTCAGGAGATCAATACnTAT^ 

CAGATCAATAGTGAAAGCAGGTTCTGTTAAGTrrmAT^ 

GAAGTGAGTCTAACCAGGATGATAATGAAGMTITAGGT^^ 

TAAGAACAGATCTITCTCTCCAGACCXXXrAGCmy^ 

TTTTTXXnCTCriTrACTTGGa:AAAT^ 

CJAAATGTCTTTrGaGAGAftAACAGCftGGGftCAI^^ 

GTTTTCATTCTCCXCTrAGCAGCTTTC^ 

ACAGGGCTTCTATTICCAGAACCTTCTAACTCrATO^ 

ATGGAAATCSGCTCnOXXKXXXrrcAGCT^ 

GGGGAGTTTGGAAACCCTCCTAACATCmyVTCTGAAAT^^ 

CTGATAAGCAAAGGCnGTATCACTGGTATGGGTAACTGlTCTGG^^ 

AATXnTTATAlTOCACTCXXXn'AAGCTCTAACAAAGA^ 

TrcAAACTGCXaATTTACATTCTICTTAAAATACXS^^ 

GAGACAAaTrCTAGTTCTQTTTATTrTTAAC^^ 

TGTGGTCGCGTIGGCTITrACTrcGGGTT^^ 

GGTCCX:ArcATCATCAACCAAGCACX:AACAA£^^ 

ACACACAAATAACTCAGCGirrAGTAACTTTCACOTAC^ 

GTGC^ATCTGlTTATGCAfSTTACCTITGT^^ 

TGATCArrCTTCn'AGlTOXGCAAATCrrATTrACGGT^^ 

CTAGTTTCWSACTTCTGGCTTGTTGTGGGAT^^ 

AACAAAOSATCCltnGATTGCTCAGATGTCC^^ 

GTTCTCCXjTCrTGCAGCAAACTGCATrcACTCC^^ 

ACCXAGCAAACGTGQCOTCTCTTCCXXAGGC^^ 

TCACCATTTGCTCTACTGCAAAACAGGACTCATTAGCAC^^ 

AGCATCACGGGCAACTCGGCTTGGCTC^^ 

TCrrCCCTCTTCGCCAACTTCATCX:TTC^ 

CTCCCCGAGGCTGCCOGCCATGCTCGTGCTTTXn^ 

CCTCATATCTTCCACAAAGCCTTCCCAGACCTT^^ 

CCATCAGTGCTTTTCTATGGTGACTGTCTCX; 

CAAGTTTTATTTCCTCnTrAACCTTGGCCTC^ 

TTICCCCACACroATtnxn'ATTTAAGCTCCA<n^ 

riXXTTOTCACXrCAGGCrroGWGTGCACTI^^ 

TACCTCAGCCTCXTGAGTAGCTGGGACTACAGTTGTQC^^ 

ATAAGAGGGTGCATTOlXrrACCTGTGGGAGTGATT^^ 

TATTTGTAAGCTGGGAGATGGGAGCTGAATTTGAAACCXX2^^ 

ITTGCAGAGGAGGGGGGCACAGATGCCICCAGTQGCCCAAGACT^ 

CTCTTTTCATTACCCTrATCAAGATTCTGCAGCC^^ 

GTCCAGGTGCATTTTAAGCATGGAAGGTGACyUTrT^ 

TI>3GAGTTAGTCrnrn3GGrmTa5AAA 

AACCTGGAAGATGAGTCGGGTTTCCACACftCAGGGAA^ 

GGATGAAGCACAAAAOrrcAGCCACAATCXrAGTATA^ 

ACTGTATAATTrrTTTCriTCTTT^ 

CATTTGTGACTCTAGCAATTAAGCTTCCNri'l"^ 

TCAGCATCCAATCCACATIOSACTCmTCCATG^ 

GCAGGAGACAGTCCCOCACCXriOXXSACTA^ 

CTTTTCAAAGAAGACTGAATTTATGATGAAACTTTCT^^ 

TCrGGATAATTCTt:AATGWriTICAAA<OT'AT^ 

GTGTGTTTGTTTTCCAa:ATTrGTAGCXT^ 

CTTGAGCCTGGAGACGGCAAGTACCCAGTCyrGCATGCATGa^^ 

AATCAGTCATCXTTTAGCATACTACrrcCAAGTAGT^ 

CCGGTAAAGTGACGTTTTGATCAAAACAGCATGGAT^^ 

AGTTACTTAATGGAATAATACATCnTrCCTQC^^ 

AGACACATACAAAAATCnCCTATGTTAAGAATCTTTATTTAA^ 

TCGAAAGAAAGTTCACACTGCAAGATCGACAGGCT^ 

TGGGAACATATCAGTCACATCAAQCTAGGTCCACCTGTGTGAGT 

CAGAAGAGCAATTGCATTTQGTCATTCTAGTATCAACTGAG^^ 

rrTCTIWrACAAGTTCTGATTAATGACACX;Gav^^ 

TATATGTATAGAAGCGTATAATCrACAGCACTCCTT^^ 

CATTGTAGAGATGAGAAAACTCAQGCATACAGAAGGGATGGAACTAa 

CCTGTAGAGTCCGTGTTCTTACCCACTGCATGAT^^ 

ACACTTCCCTACTAGGTAGAGCAATGATTCCAGAAGTT^^ 

TAAAGACATTCCTGAGACTGGGCAATn'ATAAAGAAGGAAGGlTrAGlTC^^ 

TGCCTCTCGGGAGGCXrrcAGGAAACTrACAGriTATGGa^^ 

ACTGGAGGAAGAGAGAAAGTAGGGAQGTGGTAGATGCmTAAACAAAC^^ 

ACTAGGGGGATGGTGATTAGAAAGCflCCTCATGATCiXGTm'ACl^^ 

ACAccrnxnGAGGTcccTcercc^^ 

ACTGGAAGGTAAAAAAGTTGGa^ACX3«3AGGCTTAGA^ 
GATGGTTTAATAAATAAGATTGATACAAACACAAAGCATrAAGCAAAC^^ 
TCTGTOnrmXWUVTTlTCTTTT^ 
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gaaccccacy^caggtaccttggaaagactgcatcrcc^^ 
tttttrcaggaaaataaatttaaggcttggttatcgc^^ 
cx:caat(?k;tatggtatggcttttcctgi7iat^ 
ggcigatxsttcatcgcacagtgacagcaiat^^ 

GGTACTTICTroCAACCTAGTTCAOrGCCAT^^ 

ccctcggacatgftcxtractgcaaagtgaggatgtttcat^ 
atctgtgtxxrtotaagaaaagatatagaaagattagtagtagactc 
gatagactcttocctgtcgcxx:aggctgcagtgc^ 
CGATTGTOntxxrrcAGCxriTcri^ 

TATATACGGGGTTTCACCATGTTGGCCAGGCrcGT^^ 
AGTGCTGGGATTATAGGCCTXSAGCCACTGCACCTGGCrACT 
TTTGGCTAGGAAGTCATAGAAAATGCrrc^ITI^^ 
GTGGTGTCATCTCy«XriCACTGCAACCTC^ 

actacaggccaaaxx::acxx5cacctggctaattt^^ 

tctcgaactxxtcsacctcaaatgattccc^^ 

gcx:aggaagatggtcatgttitactttcaaqcaatc ig 'i 

ATTTTCAGTTACTAGTTAGGTAAAGAATCAAAATTAAT^ 

TCTTTAAATK^TAACCACAGTCnTCTAAACACC^^ 

GAATTTTOGGRTGATTAGACTTGTATTCTTTTAGGCT 

TTCCX:CATACACCACXnTOITrTCmCAGATT^^ 

TGAAACACGACITCTGTXXTCAGAAGGCAGCAGTCT^ 

TCAAGCATATGTTTGCAGTCATCACTGATGTTCTGCC^^ 

GTAGAAAGATCACTGCAGATGTCCTTGTCGTtnrGJ^ 

TTAAGGGAGGAATCACACGTCITAGCACGGGGTTGGCAI^^ 

CAAAGGTGTGTXnACCATlCTATGACCATOSAATCT^ 

TTCTTTAGTGaVTAAGATCTTCAAGGGAGGGATGCATTOT 

CCCTTTAGACAACATCACTGACJ^ACTC 

TGTQCCCA(X3GAAATGGGAAGTGCATTCAGGAAGAGGCCCC;^ 

TTTGCTGTC3CXATCTCX:ATCGAAGGAGAGGAATGAATC^^ 

CTCGCCCTCTCAAGCCATCXAGAAGGAGAATGAAATGCTCA^ 

GAAACTTTOSAGGAAGGCMAGTCAGAAATTATTCT^^ 

ATTGTATCCCTTITXXriTrGGCATAAAACAACAGAT^ 

TGTTTGGAAAGACTCAATAGCATATGCACATCAGGAAGT^^ 

ATICTAGCTWrCCrcCAAriTTATTCAGAATTTGT^^ 

ATTCGACSACTTAAACATAAGATCTGACAGTCTAAAATTAGT^ 

CACAATGAGTGTTTl'lOTATATGAACCGAAAAACAGGTAGCA^ 

AAACTTCTGTACAGAAAGGAAGCAATGAACAGAGTGAAGCAGCATCC 

ATGTGATAAGGGGTTAATAriCAAAATGTACA^ 

ACATCTAGCATTATTGTGTGTTGAATAAATACTTGTT^^ 

GGACTCTCAGCAGTACATTACAGGGACAGTGGTTGGGTC^^ 

TT1TATCATAAATAAATCAATAAATQGCAGATTATACA(XGTAG1^ 

TTTTTAAGGGGAAriTGGTATTTTTAACAAAGCATCA^ 

GGGAGGCCGAGGCGGGCGGATCACAAGATCGCXSAGATCGAGAC^ 

AAATACAAAAAGAAATTAGCTGTGCGri«3TGGCGGGCG<XTOT 

OVTGAACACGGGAGGCnXXSAGCTTCXywn^ 

ATCTCAAAAAAAAAAAAAAAAAGAAAAGAAAAAAAAGCATCAGGGTTGTC^^ 

GTTAGACTICTTAAAGCGTATTTTGTGGTITATITTATT^^ 

CTTCATTTCACAGTTGATAGACCAGGC^ 

TTTTCTAAACCACACATGGCCTGGTCACAAATT^ 

GATGAGGAGrn<3AAGGTGACAGAGACTGAGiW^^^ 

TGGGAGATGTrCATGGCAAGAAAAATTCi«::AAGGGAC^ 

CmmXXXaTICCCCATTCTCTCrrTTCll^^ 

GCATGATSGCATGATCATAmTCACTATAAlCIXXA^ 

CCTAGGAATACAGGCAGGAAGCGACTGTTACTGGTCTGTT^ 

TCCTITATTGGCATTGAGAGATAAGCACAATAAT^ 

GQCTTTTrcTt?IATGAAGTACACGAGGTTTCT^ 

GTTGAGTCAATAATTAACTGAGTTTGATTAAAiXTG^ 

GTCATGCACGGTGATAGTGCGTATCTAACACTTGAAGGr^^ 

GGGGGTGAGGGGGCGGAGGTGGTGGAAAGGCAGCAAQCTACGATGTT^^ 

AGTCTCATAGTCTTATTIXncmTGGCTTXS^ 
GTGGGACTCATGGAGATTTTCCGTTCrriOGAGACTA^ 



CATCTGGTAGTTGTTIGTGCCTOIXJACTCT^ 
TTTTAATXrrxn'AGCTCCAGTATCCAGCCCAGGJ^^ 

aaatcgaggcx:aaggcactggaagtcccatt(5tct^^ 

CATOXXSTerTTAATGTGCTATAAATAGACATTCC^^ 
CnTC TGAGQCCTCACTGAGACTCATTTTG^ 
GrnCTGCCTGTCGGAAAATACAAAACAACACAACAGCAA 
GTCTGCXATOTTAATGCATGAGCTCATTTrGAGCCAA^^ 
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OVGTACTAATTCTGTGTCrrGACTTrcnXX:^^ 

CATTOXTTCTCXnTCCTAAGTCTCCATTCCG^ 

GCACAAAATACAfSCCAACATTTTTGCATTTC^^ 

CXXTTCATCTATCTTTOITCCGCTIT^ 

CATITrACX:ATGAGAAGa:ATAGCAAGATGTGCCAGGr^^ 

CCTATGACTGAGGGTAGQGTAGGTAGATCACTCATTACAAT^ 

ATTCCAAGCAGTCTTCCAGCTGGACTAGCTTT^^ 

ATTCGAAAGCTITTTTCTATGATGAATTATTCCAGa 

CATGTACAACAGCGCXrn?IXXriXX3GCCI<^ 

agtgtgtctctttcictogaaggccaagtgatxk:^ 
agccagggaaatatxnricactttgctggggtgcmgt^^ 
tggqcagggcagagggtcagctcwvtggcagggairecca^^ 
aataccxricactgcaatacatgttttictit^^ 

AL"in'l^*iMGGAGCAGCACCTGACATGTICTAAAAACACAAAATAAA 
ATATCTTTCTTTGTGTAAATCITITOCXIATTCT^ 

ccGGcrixxAGawnx;TcrrAGGTATcnxn<M 

TAAAGTICCAACCTTTITOCnXXrTCTC^^ 

TCTTmCTTTCITTTTTTTTTTTGA^ 

ACTGCAACClCnyiCrcCTGGCTTCATGCCAa^^ 

CATGCTTGGGTAATTTTTCSTACTrTTAT^^ 

AAGTGATCCACCCACCTCAGCXriXXCMACTGCTG^^ 

AATCAGGCTCCATGGTCXSGGAAGGATGTTGCXKTGGGGT^^ 

GCCCA1<X^CAGATTTCGGTTCCTCTGCAG^K;TCAAC^ 

GCAAAAGTAAGAATTGTATCXSTAGCCTGACTCAGGGTTTTAGCC^^ 

ACriTGTCTTCXrCTCCXIAGGCACACATC 

TITCTTTAATAGTTTTATTCATATATAATGTGCCT^^ 

GTGTATTCAGAGTITCXZATCCAIXATCACAATCAGTTTTAG 

(^ACCTCATATAAATAGAGTCATATGTTAGTCGTCTCT^ 

TATCCCTCXX3ACAGCATCTACCAGriCATCATTT^^ 

TGTGTATCCATICATrAGTTCATOXSATTTT^^ 

TTTCAGCATGGACTTCTCTTTTCAITC^^ 

GTClTAAAATCOT<nGCATCAGAAAAAAGTCATCATT^ 

TGATrrrrrTCATGCTCTACITACAGTATGACT^ 

ATOXXSTTTTC^ 

CTGCGATCITrQATTACTACAC7«XriT0GG^ 

TATITCAGCCTTCTAACAACATTGCCITICATATGTTGCT^ 

GTGTGTCrcGATTAAGAjGAACCTGAGAATC^ 

TGCATGCTGAGflGCATGATOSGAAGATQGAGTCTTCCATCCQC^ 

TGAACTGAAAATAAGGTAACAGACCCTCAGGTCTACCTAT^ 

AATAATACATATGCTGAAATTCCTATATCCXXrCCCATACTCC^^ 

lVWW\AAAAAAAATTTTIWrcATCAAQGTGATT^^ 

TCTTTAirrrCTCTCTAAAAGCATAATrcATCTGCCAT^ 

CACCyVirAGAAATAAGATTTCTTCTTAAAAATATTCAC^ 

CAGGGGQCATTGTGGTTTGACTGTCa^ 

ATTlXnXSATTCCTTCCAAGAAGGATTCATAlC^ 

GTlXnCAAATTIGCrAAACTTAGCCAGTTGAAGATT;^^ 

AGTAAATCACATTTTTATGCTTTCTCTGAACT^^ 

AATAACATTCAAATACATCTrAAAGATGTCAgrCAC^^ 

TGATTAATTCTCCTATTAAGTTTTAATGACTT^^ 

ACTGAAATACCAGGTOTATAGAAATGOVTCIXnGG^^^ 

TTTGGCAAGTCTAGAGGGTAAAGTATCXXrrcAAQGGGGAAAAa 

AAAGAAATTCXXAGGAGAAAGGTACTTTCTGAGATTCACATA^ 

GCTTGMCTCTCA^ 

GTTTTCTITCCAGGGTOXTrATGGCATGG^ 

GAAGTGGGCGTOTraATGCCAGTCmiriTC^ 

TCGGTTTATIXriGAACACATGacrcT^^ 

TGGTCATGGGGCTTC3GAACCACGCCACGGACGCATGAAGC 

GCTCAGGGCaCTCTGTGGTAGGCJUVCyv^^ 

AAACCAAGOOIACATTAATTAATATAAATAATGACTQQGAAACGT^ 
GTGi-i'iUU<nATftTQCTTATGTATATGCCMnUVIUCAOGAAAGGA(^ 

ctctaagggatattattgttcctcaatmr^^ 
cccactcttcagaacttcaaatctaaagatctttt;^ 

AATTCTAATCXrrCAAGCCTCTAGT'AATGCCnX^^ 

TGAGTTAACACACTTQOMGACTCCXaAGGCAGOT^ 

AAGTTCXSGGTCXlACCAGGCXXrKnxy^GT^^ 

TCCCTGAAAATCATCTTACTTTTTTAACGCTO 

TGGATACCTTAGACTOTXTGGGCTCT 

CCTCACCAAAAAATCAGGCTTCTGGCTI^^ 

CAGTCCTGGCCATXXXrrcrCTGCTCTCCCT 
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OCAGGAAGAGGCAACTAGITCCAQCTGGOCCACATACTACACTITAAAAA^ 
GAGTGGGTGGAGAGGACTCX::AGGGCCATGGACX5AAGATTCGa 
GAGAAGGACarGGTTGGCTCTGCAAACCGCATIT^^ 
ACCATTCCCCGTCAGCCCCTACCCGACCCTTX^ 

CTCCTCGCTTGGCCAGCAGGGGCTGCCTGGAGAAGGGGAGGACAGGC^^ 

TGAAAAGACTXXIACCAGAAATCGGAOGCAGCCCTCXyWCATAACT^^ 

GTGTTCTCCTGGCCAAGAGTCXIAAGCGTGTTTn^^ 

GGTCAGTGACITAGCCACTCTATAAAAATGAAGAGCAGAGC^^ 

AARAAAAACGAAAGAGAGCCTCAQCATTCCTTTGAYTCCCTSCAC^^ 

Ta:XX:CCACTCACATOTTTATGGCCCQQTGAGAC^^ 

CAGAATCCCCTAAGTCGGAAGAGCTTTOGAGTGGAGGCTGT^^ 

CTQGTCATTQCTATACTTGAti' l ' l U lUUU CAGGCTC^ 

OtXXrrCCAGGTTTAAGTGAlTCTCCI^^ 

ATrTTTTGTATTTTTTAGTAGAGACGQGATTTCACCATGTT^ 

CCCGCeiCAGCTGCCXyVQGGTGCTQGGATCACAGGC^^ 

GTrGTCATAAACCCTGGAGGAGAAGCAAGATCTTTCTAGCAGGAG^^ 

GTCnrCTICTATTCArTCAACCAACAGAGAACAAATGAAAAG^ 

ICTGTGTGCTGACTATATTTAAAGACAAGATTACXAGTAAT^^ 

CCATAGATTGGACTTCACATTCAAAATGTAAATTGT^ 

GATAAAAATAAAAAGTCAAAAGGAAACAAACAATGTTOCTGAGAAGTGATAC^ 

TACAGCTCTGGCATrKrroGGGAACAOTTAGA^ 

TTTOSAAAATACACGCAGAGAGACCCACTGCATGTC^^ 

TCTICACACACATATCACCCCCATACTCGA^ 

CTCTIOSCATGCGTTCTTACCTGTTGAGGATIT^^ 

GTGTCXrrcAGCGCAGAGCCATTCTCAGCATCCT^^ 

AGACAAGCTIXnCCAAYCCATGGCCCCCGGGTCGCATCT^^ 

AACTITCTTAGAACATCAGGATTTTTATQCACGGA^ 

TGTGTGTGTGTGTGTGTGTGTGTGTGTCTGTGT^ 

TTAGCTCACTGCATCCTCTGCCTCCCAGCn^^ 

CCCACCACCAAGClTIKXrrAATTTTIGTAT^^ 

CTCXrrcATCTCAGGTGATCCTCCTGCCn^ 

TTTTTITITTTTTTTTTTT^^ 

GACAATTCTTCCTCCACTGTGGCCCAGGGAAG^ 

a^AGACGTAGCTAATTGCTCTGCAGAACAGGGTTGGCCArc 

AGTCCCTATTCTTAGGTTTTATTTCTCAAACCAGGTT^ 

TGCTCATTITIGATAAGTAGTTGAAAGCTGATGGTGCCC^ 

AAAGAGTrGGGAAGCAGAGCGTGAATGCCGGGATCTOCAGATCTGAGCAOT 

CYKATTCrGGTGGCTOSGGGGTGCCCCNT^^ 

ACCTACCyUCTCTTTTCyWXSAATGCJ^ 

AAAAGTATCCTCTCTAGGCAAATAACXSAGATTiaxrCAGCAGGATCT^^ 

CACGTATGTGTGTGTGCGTGCACACCTGCCTGCanTG^ 

CGTGTAGAACAGGACAGGACAATGACAAAGTTATICAATTAATA^ 

ATCaiAGOCenTCGGAQGCCAAGGCAGGCAGATCAC^ 

CCCTGTCTCTACTAAAAAATACARAAGTTAGCOGAGCGTGAT^^ 

GCAGGAACTCXXriTCAACXXrGGGAGGCGGAGGCT^^ 

GOSAGACTCJVGTCTCAAAAAAAAAAAAAATAATATAATAATAATAAT^ 

CGCATGTGACTATTAGCTCnGCTTTGTGCGGTTATCTACGTC^ 

CCAGTGTGCAATGCTTCCCAGCXXrnXATTCAGGGATAa^^ 

ACCACGGGAAACAGGCAAAAACTACTGAlCAAGCCnTCAyTG^ 

TGTTAATAATGCAGATTAAATTGAAGCGTACAGTGCCTAT^^ 

CXnXSTGGCTCACAAATTCAGCACTTTCGGAGGC^^ 

CAACATAGTGAGACCTCATCnCTTTAAAAAAAAAAAAAAARAAAAAAGAAAA^ 

AGTCCCAGCITCTGGGGATCayiGCAGCXSAGCC^ 

TCTOy^AGTCTGTCTGTCIXnxrro^ 

TTATGAGTCACTTATCCCXnTGAAGAGTGAGTGATGTATCAAAATAT^^ 

TTITTGGAGATGGAGTCTroCTCTGTTGA 

GAGTTCAACTGATTXnCCTGCCTCaX:TTC^^ 

TATTTTTAGTAGAGATGGAGTTTCACCCTTTTGAT^^ 

CTGCXrrcACCnCCCiW^AGTGCTGGGATTAC^^ 

gtatcaagcaatrcttacatxn*aactgcactttct^ 

caaatccatgaaagcattgagtgagaatcaattggct^ 

aattxnxstgcyacagatccitcgtatcagtaaaaict 

tggcaggagttnttgttaaacatttaccacagcagtogatt^ 

tcx:atgacctgtcxwvtgcacctgcttc 

gtgttgtticcatctgatatcaccattgaggggagcaagatgac^ 

TGGOVTAGTOGQCTGCTCAGCGTGCCTGGTTCATACT^^ 
TTATGATTAGCAAATTGCAGGTCXrrcCCOTITC^ 

ccaaraarmsgstixstcattaatgtaaaagcagctxsgit;^^ 

TAGTCACCATCICTTrimnTCXGTCAAGCC^^ 
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TGACGAAGGATTIXyVAATATTTTGAGAACTCTGAAATAATC^ 

GATGACATirrGCAATCCCATXXAOGTTXXyWS^ 

(XCCCAAGCXnxnrGCCTAAGAAATQCCCAGGCCC^ 

ATCTCOS^TATATlTCriTGTTATl^ 

AAGTOTCAATAICTCTAGTACCCCCCGCCTAGGGTC^^ 

TITGATGGCAAATTGATTrcAGGTTCATC^^ 

TTTTGTCAGAAGGGATCYGCTGGAATGAGACARAGATGTCT^ 

AlXXXnTTACIXXTTAAAATACCCAGCrACTTCn^ 

TCCACTTTACATATGAAAAARAAGran'ATCAAAClTO^^ 

AjCJ\CAAGTTATICCATCCAAGArrCIV\AAC^ 

AAGATTCnGTGATCAAGTATAGCAGCCCTGTG Ga v l ' i UCCTCA 

AlXKaAGAAAAAACrrCXXGATATTTAATATrATCAATTGi^^ 

CAATTACTCTTATGGATTCTTCATGCAGAAAT^ 

CAATITGTAAGAGGTTTGAGGGAAAAATAATACIX3AGTT^^ 

WAAACGTSTTCACC^!MAAAACCCATT1TATCXCCTTT^ 

GAATGCATGATTTAGAAAATGGAAATTTTTIAAAAAAAGC^ 

AATAGGACrnGAGTCCCCTACTATGTCXXXSAGCCTT;^ 

TTCGCAGGTTTCATGTTATTATCn'ATATGTGGTAAATAAT^^ 

AAGCTTGACKXX:TTGTAAATCAATCAAGGriTACTC 

TCCCTTTQCTOXGAAT(X:AGCCXnCC^ 

KCAGGCAGTCCCAGGCAGGAAGGCCCATTAGTGAAAAGAGCCCTCX^ 

CATTTAGGAGATATTTGTGGACCTTTCATTAQGACAA^^ 

TAATGAACATGTGCCTITGGAGATTCCAACTQGTTTGTATTTI^ 

GATAGCATTAAAACAAATGCCTCAAATCCATTrrTGGAA 

ACTTCAAAGATQCTAAATTATCTCrrTCTATGTAAGTAGGAGGC^^ 

ITCTCAAGAGAGGGTATGGGCAACATTTGATAAAAGATGTAATTATTGGG^ 

CCACTGCCCTQGTGGGGGAATCAGTCy^GGCTAGGAATCACTGGAA^ 

tttgctttagcttttcaagtatcttac^^ 

aagttaaccltitggtgaaggaatgatccasgggaaatagtaaaatatragc^^ 
tccx:actgatacttacaaatatacagaggtggtgtaaaaggat^^ 
citatcagaganttgtatgtaaatgtct^ 
ccacttaccatccacctgcatactttacacrtci^^ 

TAAAITIATArrACAGTTCXXIACrcCCCCCSCCCXX^ 

GGAAACTCTKXXrTTGCAGGTGTCTGTTACAAGT^^ 

TCTTTATATAGTATTTATTGAGAAACCTATTCTGTTAAT^ 

AAGCATTTAGAAAGACTTTCTCTATAGAGGCTTCAGA^^ 

TAGTCTATCnTATAGAACATTCTAAAAAGAQCAGATTTTAATATt^^ 

TAATTAGATAATTATGTATAATAAGCACAlTAAaTATGTGCAAaWXrr^ 

AAACAGCTTGTGAGTTCACTATTAGGTCCAIGAAATTTT^ 

ATAGATACATTTICCATGTGTGGAGGTAAAAGCAGTGCA^ 

CGGCCCTGGTACACTTAGATAGGCATCATTGCCTITCCrcT^ 

TGTACCAGATTiaTITACCTGTCCCAGGGCAAGATAGCAAGAG 

AAAAGGCTTCXrAATTTATrATTAAGTGGAATAAGCAGTTTAAAAACA^ 

ACACAGGCATAGAAAATAGATATACATAAATAATGTTACTATAATITGTC^^ 

ATTATCTATTATGTOCCACTCACTATTTGAGTGCTGGGAATA^ 

GATGGCCCAlXXnCTGTGTTTTCAGAATCACAKGC^^ 

AGGCAGGTGGAICACCnGAAGTCANGGACATCAAGACCAG^ 

ATACAAAAAAAAAATTAGCTCAACCTGGTGGTTTmCGOCr^ 

CTTGAACX:CGGGAGGCXX5AGGTTACAGTGAACCGAGAT0^ 

GTCTCAGAAACAGAAAATAAAACT^AACGAAGAGATAGTATr^^ 

CTAATGAGAAAAAAATAGATCTCGTGTGTGGAGGGAATTGAAGATGTAAT^ 

TAGrrACTTAACTAACACTTAAGGAATTAAATGGCATATCa?^^ 

TITTTATTTATTTCTAATGGTTTGCTTGGTGTAC^^ 

GTTGTCTATGAACGlTITrAGAAACGTCCTGGTOGA^^ 

GAlXXCTAGGTGAGATSGAAAGCTTCCGTCAGCTrr^^ 

Traw:erGCCAGACTCXXXX3TTGCCCGTT^^ 

AACTTAGCGGAGGGTACAGGGTCTTCTTCCAATAAGT^^ 

GCAGTCriXXXZAAGGTAGAAAGACACCreAAGG^ 

ATTTGTCTCTCAGTCAATCAGTCAAT^ 

TICCCTGCTCCX^rcTYITrAAGGTTTTCTC^^ 

ATGAATAGCAAGAAAAATAGrrcTTOGOSTGCCCX^ 

CACAGCTGACAGCXXXXXX3GGAGTCCTTCACACTCT^^ 

AGGGAGGGTTTGCTGTTQGCTTITACAARAACACTGT^^ 

CACAAACATACCTCCAGGTCAACAGTTAAAGAGAGAACCCACTT^^ 

TTCTGTTATTTATGGTAGCTAGATTTCTATTCATT^^ 

AGAAAATTGACATCATCTCTICAAATATCTGAAei^^ 

TTGGCCTGTTTCTACAA1TIT<X3TTCX:CC^ 

C(XTTCACACGTGCCTGCAGGGTATATACTTrTGC^^ 

AACGCGAGGGTTTOTAAra:riCCTTCCATTrAAAT^ 

GAGAGGTGTTAATGGACACATCATTGAGGCTGAAGCCAAACTGTC^^ 
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ATCJ^GATTAACATTATTATTATTQGTGTTACCTCATGT^^ 

ATTTTTCCTCCroriTCAAAATW 

TAAACnriTATGAAAAAGTCITCCATGTKn^ 

CTGAGAATACTAAAATCTTACACTIXXATTTTTVTAGG^ 

ATCTATTATTAAfflTACCTTTTCCTGAAACAAAATATTTGT^ 

CTAGGAACTCSTCACACACACCCACCCACCCACC^^ 

AGAAAAGTAATAGAATGfla:AATTCAAAATCACCAGAACTGAAATrRT^ 

TCGACGCTTCGCATATGTATTTATGGCTXnGTGA^ 

CAACAGGGGATACAAATGGATTTGAATTCTCTAACTC^ 

GCGTCGCCACAlCItTTCAAAAGTCCTCAGCGCTOX^^ 

GGGCATXXGTTGACTICATGTTGGGACGGGICTTAAACTAAACA^ 

GGTTTATTTCCITCTGGATTAAGAGTGATGTCGTTrTATATAA^^ 

CTATTnTrrmriTTTITITT^^ 

TrriCTTTACTAGTGAGTGGCAGATATGTTGTGGACCACAGAGAT^^ 

TICCAGCXXCXACCTrCCTATGTCTTTGAGCTTGGTAA^ 

ACGGCAGTATGTGCTCTCCAGTATTTTCATAAGAGTCA^ 

GTTGGGGCTTAAmTCnGGTAGTrcTTGCTCTT^^ 

AGATGGAGTCTTGCTCTGTCy^CCCAGGCTGAAG^ 

AAGTGATTCTCCTOZX^ICAGCCTCCOGAGTAGCTC^^ 

GGATTTCACCATCTTQGCCAGGTTCGTCGTGAACTCC^^ 

GATTACAGACGTGAGTGACCACGCCTGGCCCATAAATGTTAT^^ 

TTGTCCATTTCTACACAATGGAAACOTATAATTTTAGCCOT 

TAAGAACITITrAATTGAATGCCGATTTGTTCAGCW 

GAAGGTCXCAATCX5CACCAGAACAAGGCCAGCTTCX:TAGG^^^ 

CTTGTTTACTXnCTTCTGAGCTGATTrci^^ 

ttaatcttrtacatgagacagctatagatggatotatagtc^^ 
ggcttttgtgcaggtgactigtggcccccggcctca^ 
cccagctgtgaagattaaaagggttttgtgtgttaaaggactr^^ 
gctccgattgcttt(otgaaggcagtaagcatgrrgto 

TCGCTTAGTITCTlXXriCTACCXTtri^^ 

TCCriOTAATTTITTTTTCTTCATTTATA^^ 

TACACAGATGCATATTTAGGGTGTGCTATCCnTAGAAAGTT^^ 

CAAGGTCGTGACGGTACTACAAAGCAGGACCITTAGGACAT^ 

TGGGTTTAGTXXXTTTTATAAAACAGACCCAAGAGAGACCGTTATCC^ 

ATCTATGAGCCAGAGTGGACCXriTACCCAACACCACATCTACT^ 

GAGAAATAAAriTGITACTCATAAGCCACCTAAGTTA^ 

GATTCTTTTTTCAGTAGTTTCACCTAATAAAACAT^^ 

ACACTGAGAACATACTGTGACCATGCAGTCATTTCCCAGAT^^ 

TCrPCCCTa:ACTCCCATCX:CGGCCACAGATAGTG^ 

CTATGGGATGAGTTCCCAGGCATAGGATTGCCAGGCTQGAC^ 

TACGGTGAAGAGAGCCAAAGTCTTCATAGAAACACTCAACGCA^^ 

GATACGCACATTAGAAGGAGTAATTreACCAGCCCATC^^ 

GTTTICGAGGCTGGAAAGGa^GGTXKSAAAGCCCATCAG^^ 

GCCTGCTTTTACTAATGGTTCTGritXSAAG 

GGATGTCJVGGAriXSTCTGGTOGGAAGGCTGCTGGGACTC^^ 

GGCCCTCGCTCATTAATAATtOTTGTTATGAAGTCTAGGGTACGT^^ 

CrGTACa«:RCKX:AATGATTCACXXTCCCTAAATC 

TrcGTGGarrCYGGAAGTCCAGrrTrTCTGAGCT^^ 

TITTGCCATGGTGTTAACAAGCAAAAGAAGACAAAAGAGCTC^^ 

CACCAGGGCGAGCAGTTATGGG1<3AAGGGCTGAATTT^^ 

CTCCCTGAGCCTTGTAGAATTGAGTTGATTCTGTIT 

CAGOTSTCTTATATAACnTAATTCnCATrcCAGTGGCTT^ 

TGATATTATTrGTAGTAATTTATTTAAAGTAATTAGAGACGGGATCTI^ 

GCCTCAAGTGATCCTCCCACCTCAGCrTCCCAAAG^ 

TTTTTAATGGGGGTGTGTSGGTGGGTGGTGAATGGGGATATTCC^^ 

AGAGCTCAAAGACAAGGCnCAAGTTXrATATTCCAAC^ 

AGGTITCCTCCTTanTAAATAGAGTrcTACITAT^^ 

ATATAAATTCTGAGTAAGTATTAACCAAAGAAATGGAACCATCAAT^ 

GATGGTrACCTTTCTAAGTGTCTTTITATTIC^ 

ACACTrrAACACATAACTTGTTrTTTTm^^ 

ATGACTTGTGTGTGGGACTGGGGraxnx::ATAGGAGAAGGAG^ 

TGGGTACTGGAGATGGGGACAGTGAGTATITXSAAGCTCACAGCAGGCr 

GAGAGATTAAGTATTTTGCCATCAATCACnTIOTAGTAAGT^^ 

GCCTCTGCTCnTITCATATACAAGGGAATATGGAGGAGAW 

TICAGAAGCTQATTTGAGATTTATTCATCTTT^^ 

GGAGTnyVAAAATAAGGGTirAGGATTAAAAAAAAAAACTAGTGAAATCAAG 

GAACATTAAAGCACAACAAAAAO^AATTCriTTGAGTGT^^ 

GTTCTGGAAQCTCTOCrrCTGTTTGCACAGTAGT^^ 
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CGATCnCACTCTTCTCreTClTC^^ 

AAGACATCrACACTGGGAAATTQCTCAGTTCCTGTTT^ 

ACSOWVCACrrAATAAGTAATTGAGCAGATGAGATGCTC 

KXncraVTGAGACTTCGGATCCnT^GTCA^ 

GGCTCiWXCCTGTAAlCACGGCACTITGGGAGGCC^ 

OCAACATGGTAAAATXXOnCTCCACTi^^ 

CTGGGAGCXrTGAGGCAGGAGAATTGTITCAAGCTGGGAGCXX^^ 

GTCTGGGTGACAGAGTGAAACCXTVTCXXCXrTUA^ 

AAlTATGCICTAAATCCTKnTTGAGC^ 

ACCATCTTCATTTAAAGGCACGTATCTAAACaW^G^ 

AGTATATATAAATTCT^GGTAACXSTAGACAGCTOTAAACrGGA 

TCrCCTCron'ATTCAAATGGCTCTATAAGAGAGGAAGTAGTAC;^^ 

TCATAAATGTTTGGGGAGCTGAAGGTCyVCXXXSCTTAOT 

GCTTGTACTCTCTATCTGGGTGATGGraiTTTAGTT^ 

ATATTTATrTTTCTCCACCTTCGGTGAAAAAACTTC^ 

GGTTTCAaiy^TCACAGTCATXXrrTC^^ 

GATITATGAGAGCATGACCCTGGGGGCCTCTGGAACAGCTGC^^ 

GGCXXATAGTCXIAGCTAGGAGCTAGAGTTITAGGATGAAGAAATTAC^^ 

AGTGATTTCTGTTTTTCCTTGTTCT^^ 

AGGACTCTG'mCGTGlXXSATCACATGGGAGCTTO 

TCAAATAACATGAAGTCnGrmAGTCXXXAAACATCAC^^ 

TCCTC T ia'TTCTCTTTATAAAGTAAA 

TGTGACTAATArrAGAAGCAGACACAGCCATGATGAGCAATAC^^ 

CACTTTCircrcCCTCICCCnTAAAATGC^^ 

TATTTATITVlTTTATrTATTTGTTTGTTTGTT^ 

TGATCTCGGCTCACTGCAGCCTCIXXX^^ 

GTOnXXACCACCAGGCTCTGCTAATTTITATAT^^ 

ACTCCTGACCrcGTGATCXrACCCGCCTCAGOCTCC^ 

TTTATCirTTTAAGTAACAAAGTATTAGCAlTGT^ 

ATTITGA Ti ' iU ' i ' l AlOG Gan ' ri ' n^ ' iTri ' lUTi ' lU 'AAGAGAATGCGT^^ 

TTTTGAGGCAGGCCTrAGk5TCTGGCACa:7^^ 

GGCTCT^AGCAATCnCGTGTITCAGCCTariX^ 

ATTTTITCCAGACATAGGGCXriCACTATGlTOCCX:AAAC^^ 

TCTCTCCAAGTGCTGGGATTATAGGCATGACXXi^ 

TTTGAAATGGAGTCTIXXXrcroTCCCCCAGGCT^ 

gercatqccattctcctccctca(x:ctcct^^ 

atttttagtagagacggggtttcaccatcx:traoc^^ 

tcccaaagtgctgggattacaggcattagccaccacaccc^ 

ttgcaagaaatagttcttttggrrcttactt^^ 

gtaggggacaataacaaagaagatgccagcaatatctkrratc^^ 

tggtgggctggacagagcamiaatgaccgagaatctaccataa 

aggtitgtcccatttgtactgactctgacx:caggtig 

gctttaaaaaaaaaaaaaaagggcaaggggaggtctcagaagaaagtat^ 

atcatataagcatgatttatgatitcattggaacattgtattita 

tgacacccagtaagcagctctggcttttataattttgatgaaag^ 

a^CCAGaxnGGTCGTCAGTGAACAGATTCTTCX^^^ 

AGGGCTTCAGTTTCACCAGTAGTCGAGCTAGGGAGTAGGGGGGa^^ 

CACTCCIT^ATIGCTTCATGTGTTGTIGCCGT^ 

TTCCAAGAGGAAATGTGGTGCTCAGCCCCCC^^ 

AGTGAATCOCCAAAGGTACATGTTGAAATAQGGAAAAQGCCX^^ 

AAATGAAAATATGGGGTTCTAGCGAGGGGOVGGGAACTTAAT^^ 

AACAGGTGTCCOCCAGATTGGATCCTCTGCTCrrA^ 

CAGTCACCAGCXCCAAACCCnySTOGTCTTGC^^ 

CCAAGTC»TOXCAGGCCCCTACTA(5GAGTTCy\GG^ 

AGGGTGGCAGTGAacrTGAGGaSGGTCTGGGAGGGAGGAGOy^GCCT^^ 

GGGCAGGTGCAATGGCnCACGCCTATAATOXy^GCAOT 

AAGACCAGCCTGA<X:AACATGGTGAAACCCCATCriCT 

AATCCCTOn'ACACAGGAAGCTGAGQCAGGAGAATCGC^^ 

ATTGCACTCCAACCTOGGCi^AAAAGAGCGGGGTAAAAAAACAAC^^ 

CACACAACXriAAGGACAACTTTGAAlCAATTCnT^ 

CAGCTCACTQCCACCTCTCXXnxXTCT 

CACCATCACACCTXKXnTATTTITGTATTT^^ 

GACTTCAAGTGATGAGACrGCCTXIAOCCACCa^^ 

ATCrcTTAATGAGTTTAACT^TlTA^ 

ATAATCTATGTTQCTACATTTAATCrTTATGGTAGCCn^^ 

GGAGTGATATTGCeiGCCAGGGAATGCAaxnCAATGT^^ 

GGGGGAAAAGGTOCTACTQGCATCTAGTGGGCAAAGGCCAGGGATAGT^ 

CACCACAAAATATCAGCAGTGCCNGAGGTraX^^ 

GGACTGGTACATCn UJ ' l ' iHU ' iUTi ' ri ' n ' lUnnniUUTXlj g^^ 
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CGTAATCTCGGCICACTGCAAGTieit^^ 

aXOTAGCCAGGATGGTCTCGGTCTGCTGATCTW 
GAGCCACCACACCCGCCCTACATGGGTAT^ 

GCAAAGAATTAACAGCAGAAGATCGCTCTCACCAAAGCCXATCCAGAAAGl^^ 

TmXSTTGCCAGTAAAAACATGWSTCAGATTTCAT^ 

AAAGTAGTGGTMGTGGJVGGTAAGAGAATGTITITGC^^ 

AOXIATTTArmriCAGGCTTAATGCTriNCT^^ 

TItXXAGTTGTAATITATTAATTGAlTITOAAATTTGGra 

AAGGCCTATTCXmriTCAGTCGCTGGlTCT^^ 

NCCTCAGCCNATCriTGGNATTmiAGC^ 

a4uii ' i!Nein ' i xxrmTC L''iU 'i nonuncni ' iCuuu c' i ' int^ 

GtXGa^GAGTACAGTGGCACGATCTTGGtrcTTACT 

CCTGAGTCACTGGGACTACAGGTGCACACTACCATGCCC^ 

GTATTGCXXAGCXrKXSTCrrcAACTCC^^ 

ATGAQCTACTGTACCCGACCAAGAACTTTCT^ 

AAATtXATTCTCACATAAOTCATAGAAGTCAGCXr^ 

TATirrCTGGTTGGTGGTITOriTrcTC^^ 

TTTATTTITIGACSATGGAGTCTTGCAC^ 

CCTCriCGAGTrcAAGCGATTCTTC^^ 

TTTTirTTTATTTTTAGTAGAGATGGGGTITCAC^ 

ACX^ICGGCCTCrCAAAGTGCTGGGATTATAGGTGjmS^ 

TTAAATACCTAGGATTATAAAGGGAGCCAGCTATACaTACATATAGTTCrrCT^^ 

AAAAATAACAGGTAATGCTAATCATAAAGTrTTACAAATCTCC^ 

AAATTICCCTGACCCCTTTGCAGGCAGGAAGrK^ 

TGAACTGCACTCTCTTGAACCCCCTGCACTCCA 

TAAATGCnriTGGGCACltXXIAGGAATGAACT^ 

GAAACCCTAAAAGGAGTGTTACAGTCAGTGClCnTiM'AGTTr^ 

GAGGGTCAGCGTCACAGCCTOTTGCACCCACATCTG^ 

ATTTTAAGGATGGCAAATGTGGGGGATTTTATIXXTGATGAAAGT^^ 

AAGNCAGGAAGGTAATTTTCCCCTGGAGCTATGOCATCA^ 

TCTCTAATITCCAGCTGCTTCriCCT Cl ' iTir 

CAGGCCOXXXIATGGGTGGmTGTGGAAAAGGCAATATTTGAGTGGAAAAAC^^ 

AGTTCCAGGCTTGAGGATGGGGCCCTCACCACGGACCT^ 

TTGATACCTQCAATTCTITGTACATATCAAATTTA^^ 

TAATAATTrTTAACATTGATTGAGTGTGGATAATAATCCAGGAGCTC 

ATTTCACAACTCXATAAAGTAGlTrTCACTTAGT^ 

AATATGGAATTCGAATTICAAAGCAATCTATGGTCr^^ 

ACTTTTTGATITGGTITOGTIXrETOXSAGTTC 

GTTXrrcACAATGTACyVAATTAAGCAGAAAACAGAGGTCT^ 

TGCTGGKPSAGTCTTTCTIGAGGGACAGTGAGAGCG^^ 

ATTTCCTTmCTirrTCCTTAGGATGAGAAAGAAGGC^ 

AATTGTGCACCCIGTrTGGTTGAAACAATTGGACAGTAAG^ 

CATACCAAAAATTATCTTTTGTATTTTGNCTACXIAG^ 

TAGAANTGAGATCAAAATAAATNGGGCTOGGCGTGGTGGC^^ 

TGAATCAClTGAGGTCyW3GflGTTCGAGACTAGCX^ 

TTTTGTATTITGQGATTOXmT'AGGTGATGTGTGT^^ 

TGAACrrcCAGAGGCAGAGGTTCTAGTGACXrCAAACT^^ 

CTCAAAATAAACMACAAATAAATAAATAAAAATTGTTTAAACCT^^ 

TTTCCCACCITOXAAAAGGGTGGGAAATTTTTAAAGTAAAG^ 

TTAGAGACAGGATCTGTAGGTCAGAAGTGGTATTTGGQGG^ 

TTGGGAGTGGAACytfXriGTCAGATTCTGAATT^ 

TCTGTTATAAAATCATGTTITNTATTGAATTACAT^ 

TGTATTTTTGAA^r3GAGTTGGGTOGGTGGTOGGa 

CTGTCTCTTNACTimnTrcTGGCTT^ 

GACCAOTAGTAAGTICClGCXrrAGGCTCTCCCC^ 

ATACTAGCTAGGGTGrnCATAAGAAACAGCAGGCATAGTrACT^ 

TCTGTTTTCATGCTACTGATAAAGGCATACCCAATA 

TTCCITGrrGGCTGGGGAGGCGACACAGTGATG^ 

TGTGTAGGGAAACICIODCTrATGATACCXnCAGATTC^^ 

CTGCCTCCATGATrcAATTACCTCCCACXrn^^ 

CATKXrCAAACrATATCAAAAACO^TTTACAAAACTCTAAGCfl^^ 

GAAGCAA(aJAGTAGGAI«9QGAACANGATGCTmGGATAGGGC^^ 

AGAGGTtXTGATGACCTQQGAGGGTGGGAGCCAGACyVGAT^^ 

CXTTXiXrCMlXXCTCTCAATQGCTAAACCAACXy^ 

CAGTTGGGAACAGAGTTGAOGGAGAGGGGACAGGAGGrcACTAGAT^^ 

TGGTGGCTCATGCXrrcTAATOCCAGCACTr^^ 

GGCCACATTCX3AAGAAAAAGAATTG1CITTCGC^ 

aaaaaatctcataatgttttaagaaagtitatgaattc^^ 
caccagcx:atcgtttggacaagcttagtctacaggttci^^ 
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GATCACAAGGGACAAGTAGAAAATCCTQGGCIGCXXrrGAGAaS^ 
TCTGCACCCATITTCXrCCAGACCT 
CCCCTGATGTIKSAGGGATGAAGTGTAGCXXXrTTC 
CAIV^CTTGCTCATTGTTTXSTT^^ 

GTTGGATCCCXrAGTAACCAGTACAGTCXXrAGATACAGAGTAGTAACXXn';^ 

TCX:CAAAACTAGAGGATGAATGAGTGATa9GTAAGAGAGTTC^^ 

TGGAAAATAAATACTAGAATCArrcAAOtXXnXSAGT^^ 

TGTGGACATCCTCGATOTATTAGCTATQCATCTGT^^ 

AGGAAAACAATATCTTCCTGAGAATGTTCIX3AAGAATAT^ 

TATACeiGCTCAAGAAATATTAGGCATICITrc 

ACAACCayCTGAAAACCATTXSAGAGGCrn^ 

TCAGACXnCCACAGCATCTCATTTCTOXrm 

GAAQGTATGACAAACATTCrrGGAAATClTOGAAGGTGAAATATAGGGAAG^ 

CCTTAriCTAAATAATGACTTrOTAAAACTACATGTT^^ 

TTTTATTrTATTTTATTTTTTOTCAf^^ 

ACCrcCACCTCCCGGGTTCy\AGCX3ATTC^^ 

TGCCTGGCTAATITTITOn'ATTTTAGTAGAGACTGGGTr^^ 

TGATCICTTCGTXTKTIGCCTCCCA^^ 

AAGACTATACACTGATATCCCXXSAGCrCCXAAGriOTrGTC 

TCTTCCTCTICCAGTGTTGTCTrrTCXXri^^ 

CAGATGGACACAGGATCTGCTCAGGGAGATAAGATCAOT 



GCTGCTTTATCTrCTCTCCATCTCCTTACAGCAQCT^^ 

ATITCTGTAAAGAGAAGGCAAGAGAGGJ^TGCACnGTCC^^ 

CTTGAAGTAGAGGGAAAGGGGACGACAGAAGAAOCXrrTAGAAGTAAGAC^^ 

AATAlTITTTTCAAAAGAAGGCCTCCCTrr^^ 

CCCTGTAAICATTAATCTCGTCTTCITTCTTACAA^ 

AACACTGTTCXXXSTTGGCCATGGGATCAGGGGAAGAGAGAAGGAAG^ 

CCAGTGCTGCTCAGAGGTAT6ACCTAGAGCAAQGGGTTTAGG 

TAAITTATTCACAGTGGACCGTGAGGTTCI^GCACAAA^ 

AGGTTAGTTTCGGGTATTTCCAGCCTTAGTCTTGGCT^^ 

TTATAAATTGClTGCCATGGGGCCrrGTATACAGTAAGCATT^ 

ATACAGGCACCACAACATAAACXIACCCCCAGAGTAa^ 

CCO^CAAAATGAAGTTTTTAGTGTAAATGTCATG^ 

AAATGAAGGCATGAGGACTGTATCOnxXTGAAGGGTGTGATC^ 

CAAGATAATGTAATCTGCCXXXXSGGTCCTGTGAGGG^^ 

CCACCCTTACATATCCCAGTGTATATGGTAGCCAAGGACAGTACTXnT^ 

CAATGCAATCCCATGCAAACAAAGCTGAGCATGGGAITGT^^ 

ATGACTATCAAAAGGGAAAATGGAGGAGAATGATCACXSAAGGAGGA^ 

TCGATTATTrCTCTTTTTTAATTrcCATGT^^ 

ATTGAGTACTTAGTATGTGCTGGTOXrAGAGCCmNAGTCTr^^ 

AGTCrrCACTCTGICACCCAGACTOGATTGCN^ 

ATTATC7WXXX:TCAACX3TCCTGAGAAGCTGGGA^ 

GAGGCGGGGTTTTGCCANTGTTGTCCNAGGCTNQGTCT^^ 

AAG'TKKrTGGGATTACAGGGGTGACCCACCACACCCAGCC^^ 

a:TGTCTCCCTQCCTICCTTCCATCTGTTTGTTAT^^ 

TCTAGGTAGATAGATGAAGGGTTAGGAAGATGTAACTTCnCCTAA^ 

CACTGACTATAGATGTTCAGCCCATTACTGTGATACCT^^ 

CATCATTATCAmTWVTTTACCTCIPITXnC^^ 

TIXXXACrcGGAACCAATICATCTGTCyW^^ 

TGACAACTGTCaCGGGGGCTGGAGTTAGAGCCAGGGTCC^ 

TTGGGTGATCAQGAAGCTGGATTTGGCAGCTGGTTT^ 

CCAGAACaiATCXIAOCrANCGTCTTGGTGCX^ 

CCGCGGTGCCAGTGAGAOriGCTITrcACC^^ 

AAATGAAATOJVCCmCCCCGATTCTOW^ 

TTTTTTtXAGCATGGTATrTTGAGCGGCACA^ 

TACXriGAAAGNAAAATTTANGTGATTGNOCAGTCXyVT^^ 

ATTAANGCTANtmKACTCACACCAAAAAAAAAAAAAAAAAAAAAAAAA^ 

TCACATTATTTTXSGATOIWTIXXSGGTTG^ 

TACATTTrcACGTTTGTTTNAATATTTTAAlOTT^ 

GGTGTATGCACITCXSGCXIACGGGCTITCCC^^ 

NCTOTAGCCTGCCCTCCIClTrAAC^^ 

IGAGAGAATTCCrCAAGCKXAGTTTAlCATTATAT^^ 

ATTTTTCATAACCCTrrACTAGTAATAAATTTGATT^ 

TTTTAAAACCCTGArmTAGGAGCATTTnCimTWC^^ 

TGTCACCTGATATTGGAQGATTGTTTAITICK^^ 

CniJ'nTATATGTrTCTTeiOTlGCAATTO 

CTTTCACAGGTGTNGGGAhnGNAATGGGGAAAGAAGTC^^ 

TQGCAGCAGTCACNAATTITGrK?rTGCGT-^ 
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CXrrTTAATGATGTITrATTGAATTaTOnTITC 

GTCnrCAACTGAACTATGGTGGTTAGGTTAGTAATTiWSAT^^ 

ACTGGTAAAGGAGTCTTOTGAGGATTAATTCAAATAACriC^^ 

TTGTCATAGGAGCCATTGTOnVWXXOTAGAAAACAT;^ 

AGTGCATTTCnCTTCCAGTAAGaUVTTCT^ 

TTGOTGACAGAAGTAGGACTTCAGGTIXACATATAT^^ 

CTCTTCCACGriGKXSACAGATTGGCACATAT^^ 

TGCACCTTGCCAGAACATGTTTTCCXr^^ 

AGATAATCTAACACCAGCAATCATGCXXAGAGCATTIAT^^ 

TACTOTACCTCTATTGGGAAATTTACrCrrcT^^ 

ATTTAAATACAGGCCTTTCCCAITTCACAGT^^ 

TCTCTCTCTCriXrTCAAACACACA^ 

TATTTAGGTATAGTTAGAAAAAGCAAAAimGGTAACAATAGAATCTTtaiATQG^ 

ACAGAAAAAAAAAAAAAGGGAATCAAGAAKCOiyOKnKXT^^ 

TTAAAATCAarATCTroATTCTGAG C n'l' i U J r^ 

TTGCGTCriXXnAGTCCAGCACATTTGCT^ 

CATTGATXnaXXTTAAAATGAGAAAAGGAITAGTTTIAlT^ 

TAATGCTOATATTrcCCGTGGGTCTCTGACTCTGAAATC^^ 

GAGCTCTGAGTTTCaxnXKSATTTCTATAGGCT^^ 

TCTCATTtnGTIGOCCCGGCTGCAGTGCNAGTG^ 

CTTCCCACCTCAQCCTGAGTAGATCGGACri^^ 

GGTAGAGCCICACrrrcCTGCCTAAGCTGATCT 

CTGGGACCAATAGAGATTICIXrrcAGAATrAGGTCC^^ 

GTXKSGTAGATACAATTAAmTrAGAATTTTTAAAAGGCTTIT^ 

ACATTTATTTTWAAACGTGAAAAGa^TTTAGAAAATTGC^^ 

GTACGGCCGGGCGCGGTCXXrrcACGOTXnTV^TC^ 

TCGAGACa^TCCXXXXrrAAAACGGTGAAACCrCGTCrr^ 

CTGTAGQCCCAACTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAW 

CCGCCACTGTACrxXAGarnXGCGACAGA 

TACACAGACAAATGTTCTGACAGAAATACACTGAGATAT^ 

TGTTTATAGTTAAGTTAATGAACTTGGCCAGACAGACCia^ 

TGACATTCAAAAAGTGATITAGCCTTTGTTCCCXn^ 

CCCAGGGTTATATCTCy^TGGGTrAGGCACAGACCTAC^^ 

GTTTCTGGTTTTCTGATTTlTClCTCXri^^ 

ATTTGTTTTGTTTGTAAGAATCAGTTGAGCyVC^^ 

AGAGGGTTTCnTICAACAGAGTATCCCACAAAAAAAAAAAAAAAAAAA 

TAGCSACCTGGACTTAATCTTCATGATGlTIXn'AAA 

ACATCTCAGGATTTGAAATGTTAAGCTAACyWVTTGTTC^^ 

GATGAACCTACGTAATGATCTCCTGGTXXXATTTICC^^ 

GACATGATGTACCCTCTGTTCTGCAAGCNNT^^ 

AAAGACAGAGTGGTGTTQCAGGGACCTGCAGTCAGNrc 

TGTTrTTCTTrTTTAGTAGTTAAAGCTGAGAGCAAAC^^ 

AGCCGATCTGCAGAOGTGAOCXiTGTTCGCTCGAATTGAAC^ 

TGCXSTAATGCACCCAGCAGAGTTGTGCTCAACTCnCATT^ 

CTATCAGCXrCrCTAAACTWSGCATAACCrnSAA 

CCAAGTGCTTTOOTTCCrTrcTnCT^ 

GGCXnXSGGGGTTGTCACAATGTGATCCCTCAGGATClT^ 

GAGGAGGAGAGTAGAAI*3CAGAATAGCAAACAATrTTT^^ 

GGGATGTGGTTGGGGAAGQGCATAGTACACATATTTGAGCAGAGTACAT^^ 

AGTGTGGGCTGGCATGTTCTTCTGCACAGTCCCT^ 

TGGCACAGATGGCCTTCTTGATCTCCAGTrAGGGCGAGT^^ 

amTCCTCrrTGGCTQCCTACTGGCTAOGCCAGl^^ 

TAGAAGAAAACCACyVOGTCTGTGCACACTATGGAT^^ 

AGGTACTGAATITOTOGTTAATAAAGTCAAACTTGTC^^ 

ATGCJ^CITACACAGAAGTITGGGCXXn'AACATGGAAACT^^ 

GGCATACXyVTATGCTCAAAGACTTCGAATCCAAAAA^ 

GGGAGGOVTCAAGACTATAAGATCCJywy^CC^ 

AGAAACCTGTX3AGATCXXTIGAATATACTCa«:AATACA^ 

GCCTGATAGGTCCCCTGTCrrCCTGCTTAT^ 

TCAATAAACTAACATCnx:AACAGCCKXX:^ 

AGAATTTCTAGATAGACTTACATCCCTCAGTTTCAAAGC^^ 

CTAGCCAAGCAGAAACCXT^TGTTTTTGACTTI^^ 

CATAATCTCTTAATATTCAGTGAATAAAAAATTGAGTrTr^ 

ATATTCTCTCTGACTATAATTCTAAAACAGTAGATGT^^ 

AACTCCCTAAATAATACAATCATAGTAACAATTTATTGAGAGATi 

CTAAAAanTT-ArrAAGlAGACATAATCTTCATCTTATAAA^^ 

CTGAGTTAAATCAATCCCAAAiGTCACAGAGCT^^ 

GCCATTTTTAAACTCATTGCAATGCACAGTGGGTTAAAAT^^ 

ACACCTCTAATTCCAGCACTlTXyvGAGGATAAC^^ 
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CTATCTCTACAAAAAATTTAAAAATTAGCraxyiATC^ 
GGGGGATCACTTCAGCCXy^tKAAGTO 

GAGTGAGACTAmxrrcAAAAAAGAAAAAATCAGATTATGGCa^^ 

GAGAGAGAGAGAAAGAAAGAGAGAGAGATTATTTCTAGAATmAGCAAAC^^ 

TAGTGATKriClAAGTACATACCCACATTCnX^ 

ACTGGTACCACTGTCXXXywSGGAGTGACTGCAAGGGT^^ 

GCCCCAACICAGGGATCmxn'ATTGGTTCTAATCT^^ 

(XTCGTQGGATATATTGAGATGGTTCCnTrTAATCCAT^ 

rrATGTTGCXCAGGTKXXXriCAAAGTCCTGGGAT^^ 

GTGCCTQCAACCATGTCTGGCTTAAATCCATCCTrAAACAAGTI^^ 

GAGACCNACAGTGATGCCCAGGAATAGTCATGGITGGCGATTAAAOT 

TGrTCAlTKXrrATGGCTAACATGACAAAATACC^CCX:^^ 

TCXSAGC3CTQGAAATTCAAGACAAAG^mXX:TGGa^^ 

CTTQCTCItnCCTCACGTCaXCT^^ 

CCAGTCXrrCTIGGATrAGGGCACCACGCTTATGACAT^ 

CAGTCACy^TGGAGGGTTAGAACTTCAGGATATAAATTO^^ 

CACAAGGWVTOITATGGTCGATAAGGGGAAAATQGTCCCTAOCC^ 

TAAAGAAATTCyvCATAATTAATCACAAAAAGCTAATACTATQACAAAGAA 

GACTCTAACOTCTGTAGGAGCTAGAGAAGGCTTCTCT^ 

GGGGTO^GGCAGGAGAAGGCTTGGGACCAGGAGGTAGAGAAAACA^ 

TACAACAGGTITCGGGACITOAAAGGCAGCCCACATTGAGT^ 

ACTGGACCCAGAGTTrCATCACTTTGTTAATTACATC^ 

TTACGATTCCATIOTITATATTGTTTGTTGAAAT^ 

TATTTTTAAAATATAAATIXXTAGTCCavCTCCCTAAAG^ 

GTGTTrAAGAAGTOCATCAGGTTCTTCTGGTACTGAlT^^ 

AGTTACTCWETGTIJITACCAGAGAGAGCCTAGAAAW 

AGGOCCGGTQGCTCATGCCCXSTAATCTTAGCACT^^ 

TCAGTCnGGCrAACATOGTGAAACCCTACCrrcTAl«^ 

GGTCCCAGCTACTTQGGAGGCTGAGGTTCXSAC^^ 

CACTCXIACaCCAGCXrroGGAGAGTCnX3AGAC^^ 

GTCTGCaVGTITrcAACCCACGGGGATAGGGTGAGATGAACAT^ 

TITGTCTACATCAGCTCITICAAATTAATACAAT^^ 

AAAOGAGGGACAGAGAGGTTAGATGTGTATCTAAGGTCACATAGCT^^ 

TTrcAATGACCCTGTCGTGAAGTCTGATTGCTG^ 

GTGATTCTACTTCrrcATTCATTAGGCCATCCAGAA 

AAACTAGGTGGCTTATAAACACCKWU^TCrrATTTC^^ 

GATTGAGTGTTTGGCAACGGCCCACTITCTGGTTCCT^ 

AAAGGGTCTXriCTATCTGQCCTCmTAAGCCCTO 

CCAGCACTTTGGGAGGCTGAGATGGGCAGATCAC^^ 

CGTCTTTTCTAAAAACATITrAAAAATOAGCCGGGTGTCGr^ 

CAGGAGAATCACTTGAACCCaGGAGGTGGAGGTTGC^^ 

QCAAGACTGTCraWVAAAAAAAAAAAAAAAAAAAAAATTCAACA 

AGTA(n<5ATGTO5ATATGATTTGAGCTAAGACTCT^^ 

T'TAATTGCATTCAACAGGCATGTATTrTATTAA aU 'i U 'i CT 

atgtgactcacgcagagcagaato:gaattitctaagtg^ 
tcxxttcaatttagggggagacattitrttaagtaa 

CTGCCTCATTCSGATGV^lCATATrrTTAACTI^^ 

GTTGACTCTAAATGATTTIXXXnTICATGCAGTA 

TATTOAATTAATITTGGAGAlXjAAATrATITrGCXCAA^ 

GGTTTGTTGGTITGAAAATAAACGCCTTCATTTC^ 

ANCCTCATTAACATCAAGGCITAATTGTAAGTTAATGTGC^^ 

GTrrra^XCCCCAAATATAGCOCCAAACTO^ 

TTAAANCATCATArKriTTrATCGGTGCCTIX^TAATAAA 

ATAAATCTCAGCTCTCTAGAGTXnCACTO^^ 

CGGTCGTAAAGGAGGGACAATAGAAOriCrGTAGAATCAAAGGC^^ 

GCCACGAATAGATTCATTCaTCACX?rCTC^^ 

TGAATCTGCTGTAAGCTCTGTTAATATATITGTTC^^ 

TATATTATICATCGTAATGTAATrCCaXXnTTAAT^^ 

ACrnGGATOSACCATTAATCATCX^ 

GCAGCAAGACATGGGGCCCACTCXAGACCAATTGAACGAAAAGCTC^^ 

GCrAATGGAAATCATTCTGGGCAAGGAAACAAAACTGAACAACATTT^^ 

ATICreAACXSGCrTAATCCTACAGTTrAGATAT^^ 

ATACXinCTAACAAACATTTXn'AlGGTGTATGTAGGa 

TTAGCCAGATAAAATAAACAAGGACAGAGTTGACTGAGAAAAAAGAAA^^ 

AAAATTCCCATCCCACCXXCGCAACAATGAAGGGAAAT^^ 

TGGTrGAATGAAGTTACCTGGTAGGGATITAATCOTrATT^ 

TCTTTAACATTCTCTTCTT^CTAAAC^^ 

ATTT'ATACTCAGAAGAACAAAAGCAACCCAATTATITGAGAGA^ 
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GCAAACTTTGCAAAGGGGTCACAATGATAAATATITITCTG^ 
. ACAGGTGGTAGACTGGATTTAGCTTGTGGGTTGTT^Gm^^ 



TCTTAertXTCGGCTGTCXriTGGGGATGGCAG 
TGATACGCTTTICCATCCTTGAATAACTTCGGAAW 

CGGACCTAATirrTTTrTTAAGGATTAAGAAAAAAGGAGTAAGAGAGAT^ 

CATTAATTXyvTAAATXXACTGTAAGGAATGCATGACACTGT^^ 

CAGTCAATATATTCAGTGAAACAAATACXrrAATATATTCCATG^ 

CATGAATATATTtTItTIGAAACAAATATCXnG^ 

ATGAATTTATTCCATGAAACAAATATCGrGAATATATTC^ 

AGAAACCTAATGAAAACAGCATGCTCCTATATCTITrT^ 

AGCAITCTCTCAATITICCCAAGAAAGGAGG^ 

TTGACXXIAGTGTTCTAAGGAAATGGGAGAATATrcCC^^ 

TTGGAATGTTGCAGGCaGGCTCXXSCTAAGGTGaT^^ 

GGATTCTGTCTCOTAGCTGGTTGCTCTT^^ 

TGCCACTCX:(XCCGCCCX:ACCI<3CCCXrGCXXC^ 

GCCTCCCA™3AAGCATCXrKa^AAAAAGCAGA^OTK^^ 

AGGGACATCATTGCTCTICArrTTACGClCGAGATC^^ 

GTGCCrAAGCANGGTGT^03TTCTGOTTATC^X^^^ 

AGGGAGACTTTGAGACCTCXSGGAGGGCTCTCKX^^ 



GCCTGATGACCXyVGGGACCCAGGCAGTCCTrACJ^^ 

ACTTGATTAACCCCATTTCCCCTCAAAAGATCATACTIOT 

ATCTGATTTACACAAAGCXATCTCXrTTC^ 

CICCATAGCATCTCAATCTTAATGAATAAAAGCACTG^ 

TACAGTTTTGTAATCXIAGCTCCATGCTGGATATTrACC 

GTGAGGGAGCGATGAGTAGAGGAGO^TGGAGJTCGTCCAGGA 

CCGCAITCAGAACGAGAGCTCJ^TACGGCATCCACGCGTGT^^ 

TCTGNCATCICTGTTAGTTCTTTCTCCTCXOT 

CTCATTCOTTITATTCanTCTnrcATrcATT^ 

TTTATTCTTATTXmSGTATGGTAAGGCCAAGATG^^^ 

TCCATGGGGACTNCCACNATGCCACAGGGS«3Tr^^ 

GAGAACTGTCAAGA(OTTna^CXriXX3AA^ 

CX5TGTX:ACTTGTAAGTGGGAGCIGAAa^TGa^AGAAC^ 

GAGGAGTGGQGTTGGGGGAQGGAGAGCATCAGGAAGAATAGCTAATGGAT^^ 

TAGGTGCAGCAAACCACCATGGCACAGGTTTACCTATGTAACAAACCT^ 

TmrrTAAAAAATGTTITrATTGTGATTTTCA^ 

cttgggtcatitccatgagctctgggacacaggggttc 

gtgggtgrtggcctggagttctiagcccaataaagccag^ 

gtggttaccagtcagctgtcractatctctaagaatrc 

ggcaataataagtgagcnagtatggcgnccctaatagaagcsagg^ 

catgatagagaagttttccctagaaggtctgtgagggaggaggcc^^ 

tcaccgaggtgccaatgagtctqgccxrrccaggtcct^ 

agggaagtgtggctgcccggagagctrcgctgaggt^^ 

agagagaaaagtgtagaaaggtoggcmytcatacagcafiagggct^^ 

agggcatcaaacacgccgtcaagca<xk5ag<xaacaaatct^ 

gtaatoxstaacaggctaaaccatgtggcacaxttccc^^ 

tggcctgccttaaggcagtagcacntagagcaggga^^ 

gggaagagaagaactctcaagattttcactatcagtqgtataa^ 

acactaagtatgtgggaattatttatatcctaccgctcaaagc^ 

caacctgaggcataaatgggttitaagctcx:ataccagtaaaactt;^^ 

agaaaggatagatatagatacawiwtacacatatokxmcagtaatagat^ 

tattgtatgatacatatccatgcatatacgtgggtgtacl\^^ 

aaaaaaatccx:caggacagtctcattoxxrmtr^^ 

ttccattcagggcacgousattrggtocc^^ 

AGAAGCXXACTGCCCACCCAGAGCCCercA^^ 

TGGAGACAATCXATTCXrriTCACAAAGAeiOT 

ACi^GCACTGAGTAAGAAGAAAAGGCSCXrCX^GATAAACX::^^ 

AAGGTCaXSAGGAATGTGGAGACATKSGTOXXXXXi^^ 

TCTCTTTCTCTCTGTCTTTTTCTCr^^ 

TAGTITCTCAGAATCCTCATTGGAGQCATGTGTATAT^^ 

AGCATTTTTCAACCAAACTGAAAGQGAAAGGTACTLVl"^ 

TKTDtflAlTTGGCACAGTTATTGTTGAATAC^^ 

GCTITAGGTAAATTACTTATTCTCTITGCACCTCA^ 

GGGGCTICICTCAGTATTCATCAAGGTAATATATGCA^^ 

TmTnrrACTACTTGCATTATTGTTC 

AGGGACAATGTITCACCCOTTGGACTGGAAGTAAGAAGCCCC^^ 
AGGAGTGCAAAAGAAGCCTCTTGTTTGACTTTGCXX 



AATTTCAAAAATCAr 

tgaattttcaittt: 




kTTTICATGGTTTTAGAACATCATATTITCrACTATGG^ 
rKAAAGTGOTGCTTTGGGAGTTCTTCTTAAC^ 



['ATTUVCACTTOTTTCTCATGCATTACCTG^ 
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AATGftGAATGCCAGCTTAAATAAATGTTATCACWVTCTCTGC^^ 

CAQGCn'ATrrATT'ATTCTGTGATAATAATAAATCXXrrXS^ 

GGAAGTAACAGGGAGAAGJ^CTGTGAGAGATaSAGGTCCAGOCW 

TGAAGATTCATTCATTGTTCCAlSWXrrACCTAGAG^^ 

GTCAAlTAGGCCXTXXTTATAATriTCTGCTTTAACAGC^^ 

AATAAACAGGGTACTTTCXCrrcCTCCTCTAAAAAAAAAA 

GATTtXSGAACGCTIXXXriTGCTQGCTCnxrAGG^ 

AGAGGAGCAGAGTTCTTCTGGTXrTCAGAGCACGGGGA^^ 

TGAGATGATTCATITATCTTCyvGTGCX:ATOT 

CAACCAAACGTACCGGTXnOTGATTTTGCACAA 
TTTATTTTATnTTATTTTTCGAGATGGACTTT^ 
GCAAACTACCCXCGCXXXXGGGTTaWXIAAT^^ 
. CAAGCCOCSGCTAATTITTCTATTrTC^UVTAGAC^ 
AGGTO^TICACCCACCTCAGCCTCCCAAAGTGC^^ 
AAACACATTCCAATAAAATAAAAAGAAAAGGGAITCTCCTAAGTGa^ 
AGAAGTCCACATTGCTCCATrrcGACTTCCT^ 
AGCTTGCTAACACAGGGAAAGACCAATTATCATGAT^^ 
CATAGACATTIXXSAAGCAAAGAGAAATGCTITrCATG^ 
CXXXXSCACCGGGCTCTTGGGAATATCTCTTAGCCCT^ 
ATTCTAGKnxmCGGGATACATCCAAAAACAAGCAGGAAAACC^^ 
CXATGGAGCCCGTTTCACITACACGTTTTTC^^ 
ACCCTCTATCACAGCCTTTGCTAAAACCACTCA^ 
ATTCCTCTAGGGAAGGATirriTmTITrTT^ 
GTGGCGCGATGTCAGCTCACTCXyWVCCT^^ 
ATTATAGTCACCTACCACCATACCCAGCTAAITIT^ 
CTCTCGAACTCCTGACCTCAGGTGATCrGCCC^^ 
CTGCTGCTTTTTCAGAGCCTGGAACAGTGCCAAGCACAT^ 
AGATCTGGGGCTGTGATTCTGAAGGCCCCGOCr^ 
GCCCTGGCTCX^KX:CACCACCGOCTTCCC^^ 
AGTAAGAAAATrPCTAGTCXXrrC3GA(7rCTCX:ATC^ 
GATGTCTTIX3GTTTGGCCTGCTGAAGTTTGIT?TT© 
AGACATCCCATCAAAATTGGGATTTCCTGCTTATC^^ 
CTACAGTTGGCTGGACTTCAGCAGCGGCTGaXCT^ 
GGCCrrGCTTCACTCATTGTAGAAATGGCCTGGCC^^ 
GCTTACACCTAGCTGTTGTGTTGGTTCTCATAGGICICC^^ 
GTCACTTCTCXrCAGTCTrGGCriTCATCAA^ 
TGCCTTTTCCTTCACCAAACCAAGTGGCTCT^^ 
GTCTGAGTlX3GGTACCTTCX:ATCCIXX"inUOTTCCCATAC^^ 
ATTCTCTCXriTO:TATTCCAGGCTGTCAGCT^^ 
ACAAGATCTGGCCGCTACGTGGAAGGACACATAGTTO:^^ 
AAGGGTCAGCTIGGAGATAGAGCTGACCnC^ 
CACAGAAAGCACTGGGAGCGTTTATGAATGTTGTGGT^ 
(XTCrrGGCTTCAGGGAAGAACGATGGACT^ 

AGAICATGGAGGCCAGAGCCAGGGATGGATATAGACTAAGACAT^^ 

ATTCTAGCATGAGACTGGGAGAAAGAGATGTTCIC^ 

CAACXATTGOCTCXSATCACCTTGTTAATATGGTTrCT^^ 

ttaaagcaacactattagcaaatcxxx:actggcttcat^^ 

TTTTC iUU ' lUVlU ' rn 'AATCCTCCrrCCIX^^ 

GACTTTGATCGlinGCAlGTATIXX^^ 

GCACAAAACAACAGGCTCCACCCTCCTOXCXrCTCT^ 

TGCAGAAAG(XTAGCTCCCCAGCTICrcCTGA^ 

GCTtjTTATClUWl'lCAGAGACCITAGTGACAT^ 

TmCTAGGCCATTTTTTItnTTCCC^^ 

AGTAGGGCTTrTCTCCJ«X;TAGTAGCTCAC^ 

AAOCTGTITCGATCTGAAATCTrTCTT^^ 

CTGTAiriTACTOm'AGAATAGAATATTATTAaiATC?^^ 

TtXX:AGCACTTTGGC3NAGGCCGAC5GTGGGTGGA 

AANCCCCCGTC^CTACTAAAAATACNAAAAAmTAGCCXSGGNNTOG^^ 

GAGGCTGAGGCAGGAGAATCCCTrcAACCrcmSAGGTT^^ 

CTAGGCAACAAGAGTGAAACTX^TCTCKyW^AAAAATT 

ATCATTVTOAAACCAGrnnTAAGCAATTAAAAAAAAGCAGCTAl^^ 

TGRAGGCAraSTCATWRTGAATGCXmAGAAATGAAa^^ 

AGAATTITTTrTrcnTTTGCTATOXrrGl^ 

CTACAATTCrrCTAGATCTiGAAGCrTAGAC^^ 

GTCCXXTTOGAGGAACATCCTnCCTGCAT^^ 

AGATCCTTGAACnTGTAAGACTAGGGCCCXnXOTTCCT 

CX:ACATTCCTOyV(nTTTQG<XCCCroCAT^^ 

TTTGAATIXriTTCTGACTrTlCm^ 

AGlTCCCTCCIXKOTrGAATCTCTCT^ 
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GATAGTTTCTITATCTTAAGGGCAACIGATffKKS^^ 

TATCAATTGACTCAGTAACTCXSAAGAATGCATGC^ 

TATCAAAGCCXCATTITTTTTTGAAaW^GJ^^ 

GTTTAATATAAAAGAAAGGGGCnV^TTGAAGAAGATATAAACTGCAAAGA 

ATCAGAOCmTTATTACAGGCAGACACGGCTTCACCTGTGAGCC^ 

TTCCITCTATGGCTCAGGATCrrTCAGTTACCAGC^ 

AACTCATTATCTKnCTCmXXriTAACGTCTCT 

TTTGCACCAACCNTAACAATNGGGITGACATTCNAGC^ 

TTAGCCCCCIGGAGGCATATGCTGGGC^CTCAACAAACAT^^ 

AATTGGATQCrrCAACTCrrcGArrTGATCC^^ 

TTTQGGCACAGGAGNAAGGNAAATAAGAGCXTCGCGAGGTIT^^ 

TCCTANCTtTIGAGAQCTCACTTTATCTC 

ACTATACGATCTACntSTmCGOTCXrr^^ 

TCAQCAT?UXaGAGTTAAQQGAGGCAAGCAAQGCX:CTTAACCTC 

CCTTGACa^CTTGAGGACTCCATAAATCncroCCC^^ 

TCACTTAClCCaVTTCXyVGAGGCXnXOTT^^ 

TTTGAGGCCATCTGCACTCACTGACAATCT^ 

CTTTAACTCaGQCSAAGACCCTGTCTATGGAAGriXX^ 

GTACGGAGTCATCACATCrCATAGCTGAGCGACGTCGGAGCTCa^^ 

AAAGAGAACTTGAGCAAACICTTACTCTGCCA 

TTTTAATCariXraXTATCTTTCTTTCT^^ 

acagcatacatactaatcaggtttgaaagttgtctrgt^ 
aaggatgtctxx:aatcaagatgattacatgggaaaataaa^ 

TTGACAATACTTnXTTICAGTCTATTTrc 

tatttatttctittct^tactctgaatatcact^ 
ctgacaggtattgaagccaaaaattgaacaaraac^ 

GCACTCGrroTQQGTGaACACACaCAiCAC^ 

AATAGAACGTAAAGGTAGATTTTTAGAACa^AAOXXXn^ 

AATAAACTATAITAAATTATCCAGGAGGAATGGrrAGTGATCTGATT^ 

ATTTGTOICCTQGITGGCAGATATTAAGTCCACAGATT^^ 

ATTCTTCCCTGTCCOCTGCATATGTACATCCTTGC^^ 

TCCACrcCTTGAACCAGGATGGCATGAAACITG^ 

AACGTGCTTTTATGTTO(?KXnTGCCCACTTGC^^ 

CCTTCTGGGCGATGAGAGACACAAGACCATGTTTra 

GCTGACAGCAGACACCTGCXX:AGCrciAGCXX^ 

TGTTTATAGOTACTAATATITGTGATGGTTCATTCXTiCAC^^ 

TCGGGAGCTGTTGTAAAGAAGTAAAATGTGTAGTCnTroCCCTI^^ 

GTTATTTCGTCAGTGAAOCCTCGCTGATGCCCTG^ 

ATATGCTACAGGGATATTCAAGAAATTOAAGATCCTTGAAAATC 

TGGGAGGGGTAGAGAGTGAGTGTrAACACACAGAGAACCCAGGAGAa:riTr^^ 

TTTCTCCTAAGCAATCAATTTCTnTrAAATC^^ 

AGCTCAAAAATAAATTTCIX3GOU:CXnTC^^ 

ATTrcACCTATTTCCTTATTTGTTTAAAAATTTT^^ 

TATTTTTTCTTTTCTTTCCTCTTTT^^ 

TTCTATITAlXXATGCa^TCTGAAAaCCTCCTTC^ 

ACTACCCAATTCTTGGAGCCXrrAGGACCCGAGAGAA^ 

ACTGGAAGGAACCTX:AGTTIt?rcCATTCATTCAT^ 

TGGTGCCTCTAAGTTltSAAGTCTAACTtXZAT^^ 

ACCAAAAACGTACTGGCnTrAAACAACACATGCTrATCATGTIX^^ 

CCTGGTCJAGATgrCAGTAGGCXriTGTTC^^ 

GAGGCXriCCTGCATTCCTTOGCTCTTGGC 

TGATTCCTCTTCCGrrGTCCCATCACATTCICCTCAT^^ 

TTGGGAATACCCAGATAATCOTQCSATAATCACTCaviT^^ 

GCTGATGACAGAGTCACAAATTTCGGGAATTAGTGT^ 

CTAATTCTGGACOTCACGTTCCXrrcATCCATAGAGCAG 

TAAGCrrrATCCAGGGGGCaCTCAGTGATGGGAGTCTTC 

TGTGGGTAAGACCATCACAlTITrCATCCCXXXX^^ 

TCmTTGTCriGCXTGGATGGCTTrAACATAATAC^ 

TAGAGGCTGGGAAGTCCAAGGTGAAGCCATGGGCACAITCT^^ 

GCCTTTGTGTGTCCTCTGCTTGGCAGACiU^GAGG^ 

ACCCCGTCCCXXATGATCTCAATIGAAATTAGTC^ 

GGTTCAGGCATATGATTTTGGGGGAACTTGGGGAGAGGAACACA 

CmTACTAAGTGATTCAGAAAATAGCAAGTATAGTAGT^^ 

TrciGAGTTCrnXCAAAAACCMbnTriVlMl^C 

AATTCTGAGGCATCCATGTGAGGCACAGGATGGTrcCAA^^ 

AAGTGCTCAGAGGCNAGCCGGAAGCIGTGGTITITACCTC 

CTCriCAGGACAGGGCTOVCAACa^^ 

CTAGAATTCCTQCCAGAAAGGTCXSAGATGTGCAATAAACrmSAC^^ 

TKnTIXSATTCATCAATCAGAGTGGGGCATGGCTGTC^ 

CTCTGTGAATAGTATTAGTAATAGCAGCAGTATCGATA^^ 
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tcatagcagaaactcacatgtgtgtgctitctatctgcc^^ 

caccctaagtgataggtgtattcatcattcicagtat^ 

caaqcccaaccctggtgaaqggcgtgatksgcagaagc^ 

gagck3cckx:cagggtcttr^^ 

cx:agcactttgggaggccgaggtgggtcgatcacgagg^ 

CCa«m:TCTACTAAAAATACNAAAAAAAAAAAAAAinTAGCCCGGCCnT^ 
CGGAQGClXSAGACAGGANGAATGTCAItWWXC^^ 

AAAGGTia:CAACXriCG(XQCAGCX:TGCTATrGTATTrA^ 

ACTGGCTtnrcCCCACTGGCTGTGGAGAGT^ 

AGTTTTTCIXXrCGCAACTGGGCCACCAAA^^ 

agtgacagagtgtotcaacxxrrxxiagc^^ 

gtggctcaccggggctttgcgaggtttaitaatcttagc^ 

okhtcactactccgcgaatrxsaaggcggcaggagtct^^ 

gaagaacgccgtcgtaaaaatccxstgcggaatitgaaagtgtgc^ 

actttcaccaaaaatcatcctctgaaaaact^ 

tttagaaaatatatgagotacgggatgaagatgagggaggaaaatcxnt^^ 

cctcx:atgcgggaagaaaaat(xaccc7unxx:aagaaccgtggc^ 

ccaagaaattgctaaccactcxttttccaagitatatattaaa 

gcctcctggattcaack:gagtttcctgccttagcx:^ 

ATTirrCTATrTrAGTAGAGGTCGGGOTTCGCXATGTT^ 

GCCTCGGCCTCCCAAAGTGCTGGGATTGCA£X5CATGA^ 

GGCTGlTAGCTGTCACAATATTAATACTAGTinOTX:^^ 

TATCAGTAGATGCATCTAATAATCCTAACACTGTCATCCTO 

TTAACAGATGAGTAACCTACTCTGGATTTACCCAGATAGTGC^ 

GTGTCACCCTCTTGGGCTGGCTCATTACAGGCCACT^^ 

TGTTTCCX:ACTGAGAAGATTCAAGAATAAGAAAAGTGCTrcC^^ 

TTTCATTGAATCAAGGCAGTTTTCCTTTATAAAAT^^ 

TCGTTTAAAATATCACTCymTGTTGyurAlT^ 

CX^TTTAAGGAGCAGTTGGAGAAAAGGGGGAAAAAGTTAAATGTTAATGAT^^ 

CTTTTTGTACATATATGATTATGAAGCAATTTCCATC^^ 

AGAATCGCTCTACTTCTCAATACTACACTGATTAATCAC^^ 

AGCAGGAAAATTAGCTVCTrCCTGCAATAACGArACTAATITCA^ 

ATCTTGAACCTTTTATCCTCriTCTTAT^^ 

atgatctsacctcatcgacaggctgctcxntctaqctcatc 
acatggtcxtkritagagcttcctctti^ 

tggttatcggaaagattaaaaeictaaaactaacaggtggatataaatataaagaaaa 

ttcatataacttcttoattcgtaaaagggagacaggtgagaatacaggaaa 

aatttaactttcx:aaacaaaeicaggggaaactagtg;^^ 

tgttitattaatataaaaacttgcgtataaatgtccagttt^ 

ctaaikxaacaatttatttttctitgagacac^ 

ctgggctgaagcgatcctcatgccttagtctcxc^^ 

ttttaatttttcttttatagagatgcagtatcg^ 

cgcctcixk:ctctcaaagtacagggattataggtgtgaqcc^^ 

ctagaagccaggcgtttctcaaqcritctgaagacacaaa 

cagtgggggaqccagccaaaaaatcagtaaacaccaagataaaatatattc^ 

aaacmtggcactgggctggaaatgaatggcaggggagaaggcgggcggggc^^ 

ACAGGGTTTCAAAGGTGAGAAGGAGCTGGCAGTGTGAATAGCCAGGGGAAAAC^^ 

AAACGGTGCCCTGGCAAGGCAAAGATTATTCATTGCTGAAGA 

ATmTAAATTTAGAATTGAGGTTOAAGGGAAGTAITQCTO 

ttacagtagaagagaaatctacatttctgaaaatcx:actg^ 

cagacagcctttgctttatgtatcaatttttgta^ 

aggtaaatgacttgcxxaaggtcatacacccagtacactt^^ 

cattaaatgtgtatatatatatatagrnmxattacagcttaacagac^ 

tattaataagtttltctxxxx:ataggtcttgttatgc^^ 

aaactgtcctataaagaataccatattatcgaaaggatgttcaaactcat^^ 

agtggctcacacctgtratctccgcactt^^ 

tggccaacattotgaaacccctictcractaaaaata 

ACTCGGGAGGGTGAAGCAGGAGAATCGCIXXSAACCCGGGRGACA^ 

GCCTGGGCAACAGAGCTAGATTCCATCTCAAAAAAAGGCATT^ 

CAGATTGGCAAAACGTTAAGTACTAAGGTCATAACAAGCACTGGAAG^ 

gaactgaaacagoiattttggaaaataatitggtattacgtaataaa^ 

ataaagatgccttagagaaactcmxsagaactaggaaaaqctcagg^ 

cctgaaagtcctcx:aaaagccgtctacaatgaaatggggaaattaa 

taatgggtggataccagaacgtqcatcaacacagaagaatct^ 

gggcacagtggcxcacacctgtaatgcxlagcacritgg^^ 

tcaggggtctcaggtactkx3gagacnx»ggagggaq^ 
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TGTACCACTGCJ^CTCCAGCCTGGGCnAC^^ ttaCATAGG 

AGAATATATaCAGTATGATICAaTTTATAGAAAGCITAAAGTATACX^^ 

ATAAAGAAAaCCAAGGGAATGAAGAATGCAAAATATGAGCTAGTCnTT^^ 

GGAAAACAGAATQGTGCCATGGCGAGAGATACACAGQGGAGGTATTG^ 

CCTAGGTAGTCATTCTAlATGTrATTAAAGTGTACACACATTGCaTATC 

TAATCTGAAAGAATCTGCTrCCrGTAGTAAAAATTAAAAAAArr^ 

CACTTTGGGACSGCCAAGCXy^XXXSGATCAC^ 

CCAAAAATACAAAAAATTAGCCAGCX:ATGCnXXXXX3GGTGCC^ 

GGa3TCAACCCAGGAGGOU3AGCTTCXA<m3AG^^ 

CATCTCAAAACAAACAAACAAANCAAAACAAAAAAGGTTrAAGAAATQ^ 

ATACTTGGCIGTTTTGlTTITrCTTCT;^^ 

TACAOCCTGTGATTTGAATTTTCCATACTCCATT^ 

ATCACTCCTCTGCTOVAAAACATTCAG^ 

TTGCCCAGTGTACTCrCTGTCTCACr^ 

AAGATGGGTAGCAGGAATXXTGCACTCAACXIATAATCrAAAGC^^ 

CATCrKXTTCATTCTOSATCTCCACTl^^ 

ATTAACACAANATTTTGCAGCTTAAAACAGCACACATTTAT^^ 

GACTTAGCTGCSGlXXriCrGCCrAGGGimnTCACAA;^^ 

TTCy^CTGGGGAAGAATiaXTICCCAAATCA^ 

CXSCTTOCTTGClCACIXnTCGCTGTTC^^ 

GGCTTTTTCCACAAAGCAGCTTCCTTCAT^ 

GTNCTTACA^SCCCAATGTAATCATCXSAAGI^CATCT 

GTCAGAAGTCCTCXrACACATrcAAAGGGACXS^ 

ACCTTACAGTCTATCAGOCACAACTTGAAGTATCT^^ 

CTGAATCCTATCTAAAGTTGTCGGT<X5TTGCTTCC^^ 

ATCTGCCTTGAAAGACTACTACCTGrfTCTTCnX^^ 

GTATATGTGAACCACATTTTTAAAAATGCArrTCTGAACC^^ 

AGACAAAGACATATCATCTGACTAACAATGTGGAATCTGAA^^ 

GTGGTTGCTQGGGCTAGGGC9GTCAGTCy^TGGGGAGGT^^ 

TICTGGAGATCTAATGTACAGCAIGAGTGGTGATGGATGT^^ 

TAGTAATCAAATCAri7\CTTTATAGACCCTGAATATATTC^ 

AAATATATATATATATATATATATATAAATATATATATATATATATATATATATATATATATATATATATATAT^ 

ATATATATAGATATATATAGATATATGTATATATACATATATATATGTATATAACTTTGCAT^^ 

AATCCTCCTTX:?^TC3GAATGAGAAATCATCri^^ 

CTGACrriTrATTITriTICAAAGTTGTT^ 

GAACGTATTTGTTTAGAGGATACTAGCCTCAAATAGCCerATC^^ 

CnTACACCTACCCATTAGTCTATAAAAClCCTGAGGGATGCTX^ 

GGCACAAATCGTTTGATTICACCCCATCCTAAGATCT^^ 

CATGCCCTCTCriTAAGTCGTGCATTTXiACGCATAT^ 

TGAAAGCCTAGCCTGAATAATGGATGACACCCTCCTAAAGCCACGAIT^ 

AATTTCATTAAATCACAACTCTATTTACTrTTAACCCAAC^ 

AGATGGAAAATGCCTAGATAAATACTGCATGGGACATrnTAAACATGTAAAA^ 

TTTACCTCTTTATATAAAACATGCTATATTAAAAGAGAGATTAAATm 

ATTTTTTTCATTTTTCAGAGCATGTACTGGA 

TTTCTrGAAGCACAATATAATATTGAATIGGGAATGTTGTCI^ 

tgtatgaatttttaaaagaaacttcmtatgctatgcatttgaa;^^ 

tnaaataggcttccggcgcaqgtggtgtatgtttcaccaac^^ 

tcacaaaatgtcxagctttcttictntat^^ 

ggtiragcatagagttatgatgtcxsgttctcttto 

tccitcattcttcatcaggctgatgt^r^^ 

CCACATAAAAATGClGCTITGAGOCCTGGTACei^ 

CTTGAGGCCAGAAGTTTGTGACXVVGCCTGGGC^^ 

TGATGGCTCATAGGTGTAGTCCCAGCTACICAAGAGGCTAAAG^ 

TGAa:TATGArrcTGCCACTCCACTICAGCCTA» 

ATCTTAATCCrTTCTCAAGATTariTCTAAG^ 

ATCTTGACCATriTATTTTTCATCXXATC^ 

TICAGTGATTAGTTrATAGGTAGCCailAITrCACCCAT^^ 

GAACATTCTTCTCTTGGTGTAAATTAGTCACCAGTGra^ 

TTGGCTAAAGTCAGTTACATAACCAAGCCCAGCTITAGAAGGGAAATTOT 

AATGAATATTTGGTTCACCAGACTTTTT^^ 

CCTIGTTCAGATTTTTAGaOTTC^^ 

CATGAGTAAAGGGTCCCCTGAOIACGAGCAAATCATTT^ 

AAGTTCATGCAGCCXrrGGGTATGATQCCCTAATTTTGAGC^ 

TAGCTTTGGGACGATGAACAATTTACTrATGCTTTC^ 

TACATmy^TCATTTTNCACCTrcTCMAGACCT^^ 

CACTCTATTTGACCAOVTCTTCCTAGCAGTCT^^ 

GACATGACTTTCAGTTTAGCACATGTRCAGATX^ 

TTTGACAGAATATATTCAAGGGTGG Tin CTIX K rrAT^ 

CACTAATATTrTO^CAAGAACATTTGTCACCr 

AAACTITATAAAATATTTGGGGAACATCmCTANGCCACCTQC^^ 
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ANCAAGANCACTATGNCACTAATTAAGCTGNTTC 

CACTX3GATTAGTCTCTTGCCTTTCAATTTTAT^^ 

TATATATATATATATATATATATTTTITITITITmTITr^^ 

GGCACTATAGGCa:CCACCACCATGCXnXXXTAATTTO 

TGCnCTCAATCTCCTGGCXrTCGTGATTCACC^^ 

CTGGCGAAAATTATATATTATACACACAAACATACACACACGC^ 

GTTATGCACnx?ITrcAAATAAATTTATTGOCfl^^ 

GTAAATGGNAAATGAATACAACCGCATGGTITTCrrN^^ 

CATTTCXS^ACATGTGTTACATGCTACAGAACATC^ 

mATTANCCTGTAATGCTGTnSTAGCTTTGriTGAT^ 

CCACC^INACCANCAAAAGANGGCAGTAACATmAGTATO^ 

GTAGGTrmCTITCTTTGTTTTITCTTXT^^ 

ttctagttatrgtgggtgttaaaaatcactttaatgat^ 

ctaactaccagggatgaaaaggcaggttttcxntrcttato^ 

agtagagtgtcacagcatatgtcgacggtcngacncatat^^ 

gcgaagagatc^ctgagccaaa^7rcaagggtaaa^k:tgagt^^ 

aggatcatgggaggggaggagcgtgagttcctcrrraagtgaatataaacca 

ggcgagagagaaggatcxrraggtgaggtgctxttogtgcgccagagac^^ 

cx:aggaaactcctgtacttcaaggttttggagtcx^^ 

TACCGACGGAGAAGTGGGATAAGAACTATGTGTCnTITCACT^ 

CTAGGTAGTTGTGAGAAATGAGGAGATAATAraTTAAATCTCCTAC^ 

TCTCTCTTCTTCTGAAATGAGGCTGCCTC^^ 

ATCGACTTCTGTrCTGAATGTTAAAATGCAGGTGGCTGTC^ 

CTCTTCriTGTGCACATTCCTGCi U n'iClV:^^ 

AACCTCCACCntrCAAGTTCAAGTGATTCTCXrK^^ 

ccggctagcttitgtattttragtagagatcgggt^^ 

ttcxiaccttotcggcctcc^ 

caatgctgagcaccaattacagattcatoxx:tc^^ 

aaaaacaaggactcctggcagatgcccagaggcatitam'aagcac^^ 

ggccmgaacrw\aagacacagaaaggatttctttaot<ct 

TITCGTATTAGTAAATTATAGAACCAACTTTTGTAACACTATCT^^ 

TCAGCTCCACGAATACTGCAGTGGTGAGAAAATCGCGACAAANGATTA^ 

ATrTAATAATGrCCATGTATGATNGGCCTTCATTAGAATG?^TTC^^ 

CCTTCTAATTTTCriTTGGTAGTGTCTGCCC^ 

GATATCTGTGAACCCrrcGCAAAGAGTAATCTATTGAAAAGGAA 

TITAATCATATGAAAATCAGCCATCAGCCTTATTAGAGTOG^^ 

TCTGTTCTTAATCATTGTrGTATATCCTTAATGCG^ 

TCTGATCTTATAAAGGAAAAAGTTACATATGGTCCTTTGAAC^^ 

GGGTCTTGAGCATGTGGCTATTGTATTCATTGGGCTGGAAGC^^ 

TCAAAATTGAATHXTOTACCCTTTTTXrATATGAAT^ 

AGGAACTGGlTTTATITrAATrGAGTTTTAAAGCACn^ 

GTGACTGCCAACGIXXrrAGTCTTGTCGTCTT;^^ 

TCTTTTTTTTAACCAGTAAATGCACTGAAACATGGAATC^^ 

GCTACCaVTAGATTGAAGTAGTTTATACCAGTACATTTITrAQ^ 

ATCTCriTGTAACACCAGGTTCCCTACCTTr^^ 

AGTTGATTTCATATCCTTTTATTTCATAAACnTI^^ 

CTCGAGATGACCTTAAAGTATCTGACCCXXAAATCTAATC^^ 

AACTTOGGGCATCAAAACATTGTTTTAAATATTTCGTTO 

AGGAACGCTGGGTGCAGAAGCATCTriGGTTTCrcT^^ 

AAATGTACAGTTATATCTTCAGCACAAGTGAATTATTCAT^^ 

ATGGCCAAAGTXnXSTCAGGTGAAGGTAAAAATCAAGTAAGATCT 

GCTTGTCATCTOAGGlTWnGTTXWa*ATAGATAAAAATGT^ 

TATTTCTTACAAAGTGGGAGTATCAATTTTTOTAA^ 

TGACXSTTCArrcGCTTGCCTW:^ 

AGACGTTATGTGCAATCCTTGTCmCTAArrAT^ 

ggttgattggtccaacotagcck:gatacctctatc^^ 
gagtaaacttgtotatccatagggttc^nttagaaaagtcttcat^^ 

TCAGGGGTCCTTATTCCACCCTTIGATTCCAAAAT^ 

CTAGTGATGTACCGGGAACCTCCTATGOjAACCAGAGCTCT^ 

TGTTCTTOGGGGC^IATCACAGTCTCATGACAGAGTGCA 

GCAAGAGACCTAGGAATTCACAGAGTGAAGATGATTTCrrC^ 

TTAGGAGAACnXSAATCTTITGTTTOXSAAGCC^^ 

TTAAAGATGTGTX3TTCCTGAAACTCAGGAGGCTGCTCT 

AAGGAACAACTGATTCTCTTQGCCCTCCTTAG^ 

ATTCTGACAGTACTCAGATTTGCTTCATCTAC^ 

GTATCTTCAAAGGAANTTTAGAATCATGNTCrmAATGGCT^ 

CATCCCAGGTCATAAAATGTAATGTOSGATTGTAGGTmGTGGGA^ 

CTIGAAATGTCAGG(7IXXrCX:ATTCATCnTCCC^^ 
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CAATOrcAA0CaVTCAGGCAGCGGGTIO3GGGGGT^^ 

CTAGAGGGTTnWGAGTTAACnXXOTlTCACATG^ 

CTXSAANACAGTOZACTGCAGATGTAATTGAGCC^^ 

AAACACTTCi^GAAGGAAAATGCTGTTGCAGAAATGr^^ 

CTITCTTTTTGTGATCACAGTGTOTAATTATAAAGT^ 

ATCCATGAGATCTAGGACCACGTCTGTCTraATCAri^^ 

CATO\CX:ATCACCATAGCTAC7rATTTATIXXXATATA^ 

CCCAACCATGGAGTCATGAGTCAAAGATGGTCAAATOKX:C^ 

ATCCATAACCCCAAGATGTGACTTICCAGGCTTGGGA 

GAGTAAAACTGJUACAGTAATTCCCATGTtXrrGA^ 

TAACGGGAGCCCATTATAAGTGAACTGAGCnCTIATTrTAATATATACCT^^ 
AGTGAAAGAAGACAGACTGTTACCAGAGGACATATTCTGTTCTA^ 
ArmCCATGTCSAGAGAAGTCAGAATAGCAGTrAGC^ 
GGTATCXTKXSAAATITTCCATTCATCl^^ 

AATATTTGTGCACGTCGGTGCGTGTAAGCAATATCrcAAAACAT^^ 

CCOSCCTCGGTCAAAGCACACAGTGQGTrTTC^^ 

CGAGATCftCCGCTTCAarrAATTCCCTAriX:^^^ 

ATOmTTACItrcAAAANCCTGAACCNTAAAAANGAAATI^^ 

AGATGTACAAGGCTGNrGGTTCTGAOaUXXXXX3C5^^ 

ATGTAATCTGCAAACnCAGCSCTGAATGTTGATCTAT^ 

GGTAGGATCTTAGGTCGAATGGAGACCATCACCCTGGCT^ 

TTTCGGGATCGGCCACTTACTTGTGQGGTTCACTC^ 

CCCCAACTCAACTITCACTGGCCAAAGTACTT^^ 

GTCOTiroriCCTCGCTTCCACCITATAGT^^ 

TCGCAGCACAATCCGTTTGTTGTTGCTX5TTGT^^ 

TTTTTCnTTTTTTGCCTAArrTGTGCT^ 

a3GGGCAGATTCCCGTATCATGGTGCT^KX:TTGQCCTGATT^ 
TATTGOCITGTTCKriCTTCXAA^ 

TAATCCXAGCAGTOTKXSAGGCCGAGACrcGGOyVTCCC^^ 

AGATGTCATCTCTACCAAAAAACAACAACAACAAACAAACAGACAAAC^ 

TAATCCCAACACTTTGGGACXXrrcAGGTGGGAGGATTGTrT^ 

GACOrTGTTICTrAAGAAAAAAGAAACATTTAATTAAAAATTCAG^ 

TGCATTTCAGCCTCXXSTGACAGAGCAAGATTCTGCC^^ 

ACTCTAGCn'AA<XTGAGGCTAGACAAACCTCTCCATGTC^^ 

TCriTCTCCTTTAGTTTTGTTTTTCTAC^^ 

ATGCTGTTTTACTCATKTIXrrAATTGAAT^^ 

ATGGTGACTCAGATTAAGAAACTTGCTCAATGTAACtMGTC^ 

GC(n<?rit5AATGTGTAGGATrTTGCTAAATCTTI^^ 

(XATTATTCriTTGACATAGAGGAATlTO^ 

GGTCTATAATITCAGACTCAGGTTTGTTTAAGTCC^^ 

TATCCX:CAATGGAAAAATCX:criCICCCACTTTTCrr^ 

AAACCTCTACTAATTCATGAaXyATCm'ATTTGATA?^^ 

G<X:TCACX:CTGTrGCCCAGGCTGGAGTCC:7Wa^^ 

TATCCTCTTACCTCAGCCTCTTCAGTGGCTG<^^ 

ITlTCTTTTTCn'AGAAACAGAGTCIC^ 

CCTCCCACAGTATTGGGATTACAGGTGTC3AGCX:ACTGCGT^^ 

TCAGAGAAAGCTACCAAAGGGAGCATAACICIX^ 

TATCATCATCCCATTCATAATTGATTCCAAACGTGCAT^ 

AGCGATOVCrroGrrGTCCTGAGATTCXCTCAC^^ 

CACTTTCTTTCXnGTTGGATCCATAGGAC^ 

CTCTCriTGATCTGTTATATGATCCTTAGGC^^ 

CITGCTGTCAGGftAACAlTAAGTITO^TAAGTGA^ 

(XTCCCCATTQGAAATGATGGCAGTGGTTACAAGGGAC^^ 

AATGTGACCTCXTW^TTCACOTCAGCArrATG^ 

TAACCCTTXXXrnCCTGCTGGCGGG(XriTG^ 

GAAGTTICCCCTQGGTGGCTCCATTGCrrCCAGGTGAA^ 

CAGGTTCTTGCACnTGGCAAACAGAAGGAGACCAGAC^ 

TTTAAimCCCAGCCTGATTrATTGAGGCTQCCTTTGC^^ 

TCTGTACTCAGATGAGGCTTGCTCTCAGGAAGT^ 

TTTTGTAGTGGCCTCACAAGTTTCriTGTAGC^^ 

GCroXSAGGACCAGTrAGTCTATmro?^ 

ITOTrGTTGAGTOITCTrTCCTGCTGA^ 

ATTGACGTAGCATCACTTTATTGAGCAGCTACTATGT^^ 

CAGAGTGGTCTCTCnay^GAGGTGCAGGTTTAC^ 

GAAGAAGAGGAC3GAGGAGGAACTICTCAGGAAGAAGaMXX7^T^ 

CXIACnomXMAGCCTCTTGCTGGGAATGTGGCCAT^ 

CCTTATTAAGAOCITATCAGTrACCTGTAGATTCTC^^ 

CTCi^GTTTTGGCAGGGTrGAGTCTAGTATTTCreT^^ 



SUBSTITUTE SHEET (RULE 26) RO/AU 



wo 01/44466 



PCT/AUOO/01539 



49/75 



i^mTCTTATTGCOGTAAGTGCTCAGGAGGGAACAGAGGGAGAAGGAT^ 

GCACAAATGAAGAATATTTAGTCXIATTCTCn'ATTAGTCX^ 

ATCTAGTTGTCATITIATTGACAGTCATITCTACC^^ 

TITCACTAGCTTTATGTGGeKX3CAGTTC^ 

AAGAGTa3ACCTTCX2«:AGAACCTGCAAamr^^ 

CMTAGAGAAGTXSGTTOCTTAAGCAAGTAGTTIXXrGOT 

GATTGCATCTAAAAATAAAGACTTCnXSAAGGTAAAACTOn'flGCTr^^ 

TTAACTATGATGCTACTTCTTAGCTGCAAGGGGGGA^^ 

GGCATTTCTTGX3AATTTCXX:AAGAACATTTCAT^ 

CAACAAATTTTCnGTTtXTTCTTAC^ 

ctotcxttaaggagagaaacacatataggataggaaaatagctigccat^^ 

TCTCTCTCTCTCtmnCTCTGTTTATO^ 

gqctggaaaattcaagatcaagatgcagaca(xk:tt^ 
acgtoxrtgtgtccttatttggtwcxwvgcaagtgtgt^^ 
aagacagatcxjwsaiotiogcwatggtcaagaag^ 
gctcactgam'aaataaataggattitocxrcagtitjggggaagnag^ 

TTKSTGTGTTAANGGa^AGATCGAAATGTCnTNNGTAAC^ 

NAGAGNAGTAAGTAmTNGAGNAAG<X:ANC^^ 

CAGTGCCAGGTCTAATGGGCTTCATITCCAAGACCT^ 

CTGGGATGGTTGCANGAGGflGTTGTTAAGAGCTCA^ 

GCATTTCTAAATCrrGAGaTKriaili^^ 

GGTACACGTACAAGGCGCTCAACTCyu:OGTGGCTCa^^ 

GTCAGGTGGTGAGAGGTGACATCX:CATGATGTGGTTC^^ 

CCXNCCTrKrrcCCTTTTTCAAGATGGTT^ 

QCACCITCriCITCCTl^TATGTACCT^^ 

tctccttgacx::agaactgtatcaaat^ 
atggnaagagtaacagccxiacn^acacaactgtc^^ 

ATCCCCKnrCariCTCATGTATCTCTArrAC^ 

CAGATAACAATCOTAACATCAGGCAAATCXXnTIT^ 

rrCXrrTCAGTTCTCTGAGTCTCAGCT^ 

AGAAAAGACrcAGATAA^rcTAAAGTACCTGATTGAGTAACNTG^ 

NNGGGTGGCCTGGGGTrATTCrrAGTOCNAGTCXj 

TACTCAGTGCAGCCTCTGTGAGCTrACCACCACT^^^ 

ATGCGCTCAAGCGACCATAACCflOCAGTGTTCCTAACAATXXrGC^^ 

TTATTI^TCCCTCAAGGTAAGCATCGATTTTACCTT^ 

ANTTCACCNTCANGCTGfrcAGamaiAATATa:^^ 

AGAANCAGCATOSGTACAAAAAAGGTCTGCTAGGCAAAGAAGCT^^ 

TTTGTTIGTTrTTACTCTCnCACTIT^ 

GTGGAATTTTGGCAAGGCTAGCATCATAACAAATAAACATT^^ 

TCAGAAGTTCCTGTATTAAATTACAACOVTAAGAACATACACAAQCAT^^ 

TCCATAGGAax:AGAGTTATTCCTCAAGCCCX:TC^ 

ATCAQGAGGCAAANGAGAAGGAGGAAGAAGCCTCAGTGGGGTTG CU ' lUi ^ 

AGGtCCTGGCATATTTTGGCAGGCCIGGATCA^^ 

GACCCAAACATGGNTGGTAAArrCTIXXy^TAAAAC^ 

NCmx:NACCGm:ATCGaaATGATCOQGGr«:TOT 

ATAACTCXXrNTGCCCAGAATl^ACCAGGatfXrAT^^ 

GAGCTTTTCNTAAGAAGCCXrCACCCAACACTACTGCT^ 

GCCTtfSGGAGGCTOSAAAGTACAGCCTTCAGTCC^^ 

bKSAAATGACAAOSAATATTTGGGANCCCACTOGGAGTT^^ 

CTCOXrrGGNAGmCTATCXrrcATTX?n^^ 

ATTATATTCCCCAGACTGGTCTTAAACrcCIGGCC^^ 

AGGCTTGAACCACCATCCCTGGCCriGATftCTGATA^^ 

ccgtcaccx:ttgtactatcccattrtccm^ 

tcx:taccagccagaggtgggcacatgacccaggtca^ 

ggaatgagcatatatcaaacavggacagtx:x:aagtcct^ 

CATAAtnrCACCACACACAGCTAAGCTGGTC^ 

CXrrGTTGTAGTTAAAAAAAAAAAGAGAGAGAGAGAGAGAGACTGATACCCAAA^ 

AAIKIAAAGATAGAGCTAGCATGATTGCAAGCTCflGGACATAC^ 

GCTGTAGACAAACAAAAATGATCTTTGGGTTCCTI^^ 

TGTCCCACTATCATTATATCCAACTTCTGTAAT^ 

ATATCGTCa\GGCGGAGATGTCy^TGTGATCACGr^ 

CTTCTCAACTCXyiCCTOirrc^ 

rmTAACCCXIATATACGAAGCAAACCrATGTGTATGTGT^ 

AAAGTCXATTCCTGACATTTCCGTAGA CTU l Tl ' iTriUV^ ^ 

TATTCTTGCATTTGCTGCAGCAAAACCTCACC^^ 

<XCXrCCCNCCCACCAAATTKnXXX:GTTGTCrr^^ 

CTACXn'ACAAATCCCACATGAAAAGTTCTCAATC^^ 

TTCACCnCCTTICCTCGTTGTTTCCAGAAGGGGATG^ 
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ACCTCAGGGTTTCTCATOGACCACACCnTIXrr^^ 

TITAAGTTTTTATTTAimTrAWrTTAGNCAmTt^^ 

CAAlTTCATrnCATAGlTICTCKmT^^ 

AATTATAATTAAGAGAACCAGAAGGGCCTQTCrrCr^^ 

TGAAGACTIWXXSGAAAAGQGACATGGCCXmGGGAACCAGftGTGGT^ 

ACCCXAGGGTCGAAGTTGCCATGCGCATGAGGCTGTGTT^^ 

AGa\ACCTCTGAGGATGGGTAGW2ATATACTrCATTG^ 

TCCKXXrAGCAGGTTTAGCAGrrcGTCXXSAC^ 

TCCTTTTICCTGAATATCGAACCAAAGCeK^ 

CrGACAGa:ACAAGACrCTCXXnCC<^^ 

CATCTICTTTCXIAACTAACATACTCACT^ 

TCCTCAATCTCTCCCTCCCXy^<XACAGTTAGCT^^ 

CX:CCCCAGCTGACCAGTGTC3GATGTCTrGAT^ 

GGCACSAAACCTGCACTCAGIT^TCTATACATGGTGTAAT^ 

TTrATTATTTGTGATGAGAAGATICAAAACCCGCTT^^ 

TCACCOTIXrrcTGCAACAGGGCACCAGAAT^ 

AAAOrreAACTCAGGAGAGTCGAAAAGAACTrc^ 

AATGGTACXSmAGATGAGAAGTTmTACTCnT^ 

ACTATATATGTATTATACATCGTAGGACTTATTTTTITAATGTG^ 

AGTGATATCAACAAAACAAAACATCATATCXAAAGTCACCC^ 

myrrCATTTGTTTCATGCAATTVGAAACGT^^ 

TtkrnnXIAAGGTCTTGGCCCTTCAAG^^ 

GGTGGGATGACACAGGCTGCTGTCXXAADX^ 

AGGTGACTTCCAAGATTGAGGTTOGCGAff 

TGTGCAAGGACCAGCCy^GCTGA£Xn'AGAGGAAACCTA£3ACC^^ 

ACACXX;ACTC3TGTCAriX3GAAGCX:AOCNAGGGmGA^ 

CXXrCTTTCATTAAGTGAACGAAGGGTGCAlXXrrcAGT^ 

GCTNGCCCTCCCATCTITCTACmXSTGAT^^ 

ANTCAGCCAGCTCTCCAAAGCAAGCOCCACACTCTCT^^ 

TTCTAGCTATGCAGATGCAGlTrAAAGACAAATC^^ 

GACCTATKXrrTTGTTTATTTTAGCCGGTGCC^^ 

AGTCCCCAGGAGCT(mxXX3AACm:CATAGGTGCrGTrrc 

GrrcAATCAATTAAACGGATACTATTAGCT^XTAC^ 

TCATTTGGATICrACGGTGGCTCrTTACCXATT^ 

GTTGCACCATTTTCTGAGCAATGTCTITCGTCC^^ 

TCXTTTTCAGAAACAATACTGGTGACCTGriTITAAGAAAA^ 

TTAACXntTVTTAAGTTrAAAATTGTTITTTTCT^ 

QCCTCAGTCATCTTCTATAACATAAGTTGATAGTCAATAATT^^ 

CTATATATATATATATATAAAATATATTICCATGAATATACTCAATQCrrAAN^^ 

TTAAATTCAGAAANTTTCTAGTTGTGTTTATmXrCCT^ 

TGTAAAANCTACTCAGTOIATGTTTGGGTCATo-l-l'l'lOT 



TTAAGCATGAAGATTTTAGTCAATGTGCTnCAAAGACCA^ 

NAATCTATAATAGGAGGAACTGATTTTCyW^ra^TATO 

TATTTCACTATCaGAATTTTCTAGAAjCXTWnTAA^ 

TtnGTAAGACTGTCTGTCACGTGGGAQGATTTCAAAT^^ 

AGAGGGGAATGCi^TTGAATTTATAAGGTATTATTAACTTCT^^ 

ITCATTTTGCTCTGATACAATTCAGCAAATAGCXAGAT^ 

ACCAGGCAGGCTCTCAGCmX5GTGCCATa3G^^ 

CCAACGTTITGAAAAGAAATCTCCTTrCAAAAATC^^ 

TSGCTTTTGTGCTTGGTGAGACCTTATCT^^ 

ACTCIXXCTGGCACTGTTCTAAACACATGACACC^ 

AGGAAGTTTCAGTGATGCACTTAGGCAACCATT'AAGT^^ 

ITGAACT^TCAAACTGGACTITGGGGCTCACAGTT^^ 

TCAAGCTGCTCTCAGCTAAACTCCAGGGG^ 

ATGTAAATCACCTCCAAGAATCGGTCTGCGTCTT^ 

TTTGTCTCnX3AACCTCX:GGGATCrrcTCAGGGQC^^ 

TCSGGCTTGAGACTCTGCAAAGTGCAITAAATGGGAAGGAAAC^^ 

CTCntSGOCATGGCTGGGGAATTQGCCTWGlCAGT^^ 

NCCXXIAITOGATNGGCCXntiGGTCNATCTGTAGAC^ 

TT^CCCACTTAATTCriTXXrTCACA^ 

GTCTCAGACXX3GCCCCTGAAGAGACAGGTTC3GAC^ 

CAACAGAGGGAGAATTGTrACCCX:CGGGAGACTOOGGGT^^ 

AAACCTCTGGGNATTCGGTGTGGTGTCCATGTTGAGGAA^ 

ATTTCTAGGACTCAAAGATGCCXSTGGGGGTGTATTAGTCTC 

ATTTATQCAGGAAAGAGGCTTAGIXXSACTCACAGCTC^^ 

AGGAQTCAAAGTCATATCTTACl^TGGTTGGCAGCAGGCAAA^ 

ATCCGATC>riCCnK3AGACTTGTTCACTATCAC^ 
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CCXX^QCCCTCXXACAACACATGGGAAT^^ 

ATCiVCAGGGTCACCTGTAGGTTACCXrrAAATAAGAG 

AGATGACX3GAAACAGCC0CCXy^(X:TCXMCCATTTC^ 



AGATGAQGGOWXSATACCACnTCTCCTAGGGATCAT^ 
TTGTCACTGCACAGCTACCXrrCTKX^CA^ 

ACTGCCCTAGAACGACAGCAAGACAATAGGGCCATGAAAGCATAACXX^ 

TGACTTClCrGTACAGCCATCTTtXSGAGriW^^ 

AGGTGACTCACXSCCTAGAATarrcT^^ 

OGGCATtlTCTAGCTTTGGAGGTACCTXSCX:^^ 

AAAGCATOXX^KXrACCCTCJUCCTCGTTa^ 

GTCAGGCXXriTCAGGACTCAGGGTGTATAAGTGACAT^^ 

CTCTCTOCCACTCATATAGAACTCCTITATCACCA^ 

AGAAGGAAGCCGGGTCCCTCTTlGCCCCTTiaKmST^^ 

GAGACAGATTCTGGAGAGACACICAGGCTTAGCTCTAGAAA^^ 

GCCTTCCnTGACTGGTGCATGGGAATTCOCX:^^^ 

AGAATCTATTATCICCAAGCCANTTATTCACACA^ 

AGCACTTIXXOTTAAAGGGGAAGAAATAGCTTri«:CCGTGGTGTA 

GAATACCATAATAAGAATGTGGTAGTATATTAATTGATGGAAATrGTAG^^ 

TCTCTGGAAGGAACAAAGGATTAAGTTAGGGAGflCACSTAATGGTAlTGT^ 

AAGAGTAGGGAAAGGTGACArrCCAACTGGGAATATTCGAC^K^^ 

ACnXK:CGAGCTQGGTTTTGCCTCATCATCCAATlU^^^ 

GTGCAGTCncrcTCTAACAATAAATTATAGAGTCAGGC^^ 

AAAATGCNAGACTTCCTTAlTCGrmiKrClTC^ 

CACGCTTTTNGAACTAATCANCACTAGTitfSAATC^^ 

CNAGTATTTICOCATOOTATTTTATTOT 

ATCTGACAG3«X:AGACTGGAGTQCAGTGGTGT?W^TCTC^^ 

TGCCTCAOXrrOCTGAGTAGCrcGGNACTAAAGGCAOT 

ACGGGGTTTC7W7^ATAT^NGGCCAGGe^GATC^mXW\^^ 

CTGGGATTACAGGTGCTAGCCACX:ACACCX:GGCCTCn7IC^ 

GGTAGCATCATCCCTTTGCATGCAAAAGATAAACTGAAATCT^^ 



AlXXTOGGCACTGCACAAAATGGTTTACmTGGTTTATCTTGTA^ 

ATTCTCTCXIATITrACAGATAAGAAAATTSAGa 

GCTCTGTCTTCTGCmACACTTTTTACT^ 

CGTTCTTAGATTCAACAAAATGTAATACTCXACCITOX:^^ 

TGGAGTACTAACTGCAATCAGATGTTrrAirrcTTTATTTATT^^ 

GCAAGGCTCTGTGCTATGCTGTTTTATGGACATCJ^T^^ 

TrmiATGGCACAGATAAGGCATTGGATGCTC^ 

GGGCXTKnCCTCTCACXIAGCATCAACTGC^^ 

ATCAGGCAAACTACATGCATmAAAAAACTATATCC^^ 

NAGGOTGAGGAOGACXSGATTACTTGAGGCCAGGAGaTC^^ 

AAATAAAAAATTAGCTGGGCGTGGTGGCAGCCAairiOT 

A\CCCAGGAGGAGGAGGTTGCAGTCAACTAAGATrcTGC^^ 

AATAA<n'AA^TAAATATAAAATAAAATAAAAAATATGTCCTGrrAATGAAATT^ 

TIGGAACAAGTGGGGTCGGGACAGACAAGTGACTGAOCACl^^ 

AGAAGCnSGCTGACATAGTGGCTCACACCTGTAATCCCAGC^^ 

GAGTICAAGATCAGCCTGGGTAACATAGGGAAACCCT^^ 

cacacctatagltxcagctactaaggaggcngaggcggtagga 

tgatcacgccactcaattctagcctgggtgaccgagacct^^ 

ttttgaagacacaaapicagatqcatgtggatgggcxxaaaatac^^ 

cacccttcx:ttattttctotacgacaccaat^^ 

tcatgaatgcmagtolgocagatgttactct^ 

aagcacggtgctcagtgatgctgtgagggtgagcaa^ 

AAACTTTCACCATTCTTTTGTAAAAATTAGAAAAATACA 
AGTAU"l*ll,TinTAGGATrCTATTATTrACCcri'iViCTrC^ 

CTCCCTCCCTCCCT G ICTT m 
CTTCCCITTTTCCCTCXXrnnC^ 
CCCTCAGTTGGCAAACTTAAAAGTTGTTAAGTCTG^ 
CTAGGAATGCCTGATGTATCTGTATCCTGTGAGTTTGCC^ 

ITATATAAT(^TATTrAGTGATTTGACTICITAAGTCC^ 

TCATCTCAAGGGAGGGAAOnCGCTCTTCTACTC^ 

TCCCTTGGCCCTTCTCAAAAGGGTAGCCAGTTm^^ 

TAAAAAAAATCAACCAATCCCTGCCCTTICTAGA(X^ 

CTCAGCTATCCAGAATTACAACTCCGCXriCTCXntS^^ 

ATC3AAAATAGCa«3TTATCITTGaXTTCATCXi^ 

TTAAATTGAACCTTTCCCCTGGTTAAGTGGAGT^ 

ATTAATATGCAAGTirAAATACCTCTIXXATGGAGCAAGAAGC^^ 



TGAAGTCATCrcCATTTTAAGTA': 




ITGGCnTAAAGCATCCATGTTTACAGAATGAA 




kTTTTrcTTTAACGATCCATTTACTGT 
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TTGAATGCXXAGCarrCXSCTGGAGTAGTACTGC^ 

ATTCTGCTAOGAGGGATCXXXriXXXAAATTAAAGACAC^^ 

TCCACAACCAATCATCATQGa'XTC G in'AACACAGTCAAA^ 

TAAAAACACATACTTCTTTTCAACAAAAACTGGCT^^ 

TTAGCACATTTAAAATTTTACATCATTITAAAATTGTTO 

TACAAAGGAAACGGTTACTTiaSATTTATIUrTn^^ 

aagagaagaaagaaagaaaaaaaaaagggtccttgatcccctgctc 

tgagcccctctgaacaaaagcaaatcx:tgttgacx:t^^ 

aggagaaaagaatqcnoxsaaaatcagtgacacgaagttgaatat^^ 

ttaggaasactxsaaaggcxtvtrtcxxrcagaat^ ^ 

ttcagaaagccacacxxtcitctgaaaacaggcaaaaat^ 

gaatgtaactgagtacxrtcaacacttcagto^^ 

GTTCACATCGCTGATTAATTATGTXXyVATGATCATTTT^ 
TCTKXXXX:ACAACCiam3AACTGAATAGAGT^^ 
AGTOXTAGAAATCXSftATGATTCAGATTGGATGAAA^ 
AAACACAATAAATACACAGTTAATAAAATACTTCATATmATTATAAAAACA 
TTITCAAATACAATATrrAATAAACTATATTAAAAATATATACAAAATAGT^ 
AAATCAAATACXyriTACATXilATTTACTTCTC r ^ 
TCAGCTTATCTTIXXATATATTTGTCCAATGATAAT^^ 
ATTTTTATTGGTGCrAAGGGCTGTAATTTAGCAAAGTOT 
TGTXOTAGACCCGGftGAflGTCTCCAAAGAATGAC^ 
TCTTTTTTTTTTTTITriTGGCT<^^ 
GTGTGTTTGTTTTOfflCAAGGGCAGAGTGT^^ 
AGCGTGTCTTITCGTOOCACGGAAAAGGGCTGGAT^^ 
GCTTGTCmSGACCTGCACACATGTTTGAGACC^^ 
TAAAATCTATCAAAGTATAATTAAATACCTACATWyCATAAl^ 
ArrAGATTTCAGTGATATIGAATGTGGGAGATGGAACXATITr 
(XTGAGAGTGCCTGAAACCCAGAGAGGTCXSTAC^^ 
ACCATGCATGAGACAGAACAAAGAAAAGAACTGTGAAAATAC^ 
COTAAAAGGCCAGTTICICTACTGCAGAATAATGAGGTCT 
AACCAGArrGCAAGTCTGTTTGAATCCCCCA^ 
aVGAGTOGAAGTTGCCTTATAAAGCTGCGACriTGGCCAC^^ 
TGTTGGGAQGTTCAAGAGCCACTCTGTGCAGAAATGCCa^^ 
CCCATACTOTGGAAGGGTCAAGTGCAGGGTCACCAACC^ 
' AGGGAACAGGCATTACTGACATGACGGGACCTTTACAGT^^ 

AAATGACAAAGGGTCAGGTOGTCTTAAATTCCAAATCTG^ 

GAAACGATTTAACTAGCAGCAATAAAATCTCTACTCTTTTTATGA 

TCTTCATATTTGGGACanTTTTrAAAGGCT^^ 

TACCTTTCTGTXXCACCS^TTTATTTT^^ 

ATCCTTTCTCTAAAATATATTTGCTGAAAACTAGGAACSTO^^ 

GCATCTTGAAAGTAAAACTGCCAAGTrATTTTTTTGTIX^^ 

CAAGGGGAGGCTTAGCyUirraATGTATrTTATTT^ 

GGAGACmSGAAGAATCTTGCTTTTAAAGCGATAGTCCTATTT^^ 

AAAAAAAATGTTTTCTGGTCAAGTCACATGCCTCTGA 

CCAGACICCTCy^TACCCCTCCTAGGAGACAGT^ 

TCACTCTTAAAATOCAGCCriGTCTTrGGW 

AAATATAGTrrCGTlTGGAAAACAAAGGGGTGAGGTGGGTCCAAAGT^^ 

TTTCCAAAGCCATCIGCTAATCCCTGTGATTTAAAAATGT^^ 

TTTA(7rcATTAATTACCAGGTTTCTCTATT;^ 

CCCXACCCCCACCCCCCTTTAAGTAACAGTGAOXyU^^ 

TGTCCGATTCXrTATCrrcGCTOGAAAGACTGCa^^ 

TITCAAGGTGTCCTCACITQGTGCCTTTXSAAGGGT^ 

GCGTTMTTCAGTTTGTGATAAAGATCCCTTITi^ 

GTTAGATCTGGCATTTAAlTmCCACAAAACCTATCX:^ 

GTAAGAATATTCAAGGTGGTATACTAAAGGAGATAACTATTT^^ 

GAGGAGCTCAATOCTrATTTAAAAGATAAAAGGTCACCTAC^ 

GGAAGCTGTCTTACCTGGACATATTAATITrAAATTT^^ 

GTGTGAGTTCTXSAAAGTCTGTTIATATCTGTTG^ 

TTrrATATrAGGTCTGAGATGTQGAAAACAGAAATCIAAC^^ 

TGGCTCTGATATTATAAAATCTOGAGGTTTGGAATG^ 

TATTTACMATGTGGCATTAAGTAGAACAACGTArrcrrTG^ 

GGACAGTGAGAACGAACAGAAriTCAGAGACATGTTTCTTAAC^ 

CAAGTAGGAAGCACSGCAGAGTTGCAATATGAAGAACAATT^^ 

ATATCOIAGTGTGTGTCGTCACCAAAGCATAATGAC^^ 

GAAATAGGTAGTTGAAATTTTGOCCrroGACAAGGG^ 

AAAGACATCTCCTICIXATATGTITAGGCAAGCGTTGAT^ 

AATTAGATAATACAAAAATAACAAAACTATAGGACSGTTGAGGATGGGQCTGGG^ 

GAATAAATTGTCTATCGTAAGGG?\ATTCTITGGTGCATT^ 

CCCTGGAGGCCTCCCTGCCCTCCntXTO 
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CTAAGCAAGACCAAGAAGCrrAGTCTCJATTTCy^^^ 

GCGTTAGCCAGCCTCTGCATGATTTTTTACGCAAAACAAACAGT^ 

TCAOTATGGTGCATGTCGCTGATGGTTGGCriXX^^ 

CTCTCATOXATTGGCTGAAGCAAATCATGTATACAAGO^ 

CAGCCATACAGOTTAGCCGACAGATGGCSGTCAAGAAGATIX^ 

TTGGATXXTGGTGTGCACCTQGCATATCCITTAGGAT^^ 

TrACCTACTCCTTTGGQCX::AGAATTTGCrT;^ 

TTCTAACAAAGAQGTGTAAATGGflCAAGAATGy^AAAGCAATT^^ 

ACTTGGATGTTGATOICTAAAGGAGGTGAAACT^^ 

ATGAAGGAGGGCCCATCTCTTACGGCAGGCAGCGTGGCGTAG^^ 

GCAGGTGCGGGTGGCTCTTGGCaCATGATCTAGGTTTT^^ 

AGGCCATAGTGCAGTGTACAATCAGAlTTXiATTGCAGC^^ 

CAAGTAGGACTACAGTTCTQCACTACCACAi^^ 

GCCCAGGCTGGTCTXrAAACTCCTGGCX:^^ 

AGa:ACCCCACXX:AGax:ATGTTAATTTTTlCA^ 

GATTAGAACCACCGTCTCTAAGOCATTCAGCATAGTCCCIT 

TGTGTCnCATTTTTTTCCAGATGCTCTGGGGTC^^ 

CTTCTCAGAACACCACTCAGTACTTCTGGGT^ 

CICATtjOTTTCTGCCCTCAGCTiCICU'ItXA 

CCrCCATGCCTTTTCATGCTGGTGCCCATC^^ 

AACATCTCTAOTTCTCCGGGCCTAGGGCAGGGATCACT^^ 

TGAAGTOTGATCXrGTCSATTIXnCCCXATGCTGTGT^^ 

TACAAGGCCCX:TCXrrcAAACCAGATTC<TOTCTAT^^ 

CAGlTOX:iCim:CXTTACAAGGGACATTTATCX^^ 

ACXrCATAGTGGGTCnTCTCCACTOCnTCAAa^CCTG^ 

AAGTATTCCTAGAGATGGCTGCAGAATACCTGGGAAAGCTAGCTAAGCAGTGA^^ 

CGAACTrrCXrAOCCGAGAATGCTTTCCAAAATGGGAACAftGA 

TTrcTWSAATTCAGAAGTTOTCAAATTITAAGAAGAGAA^ 

AGCCCATAGGCCAAATCTGGOnG'lU'lTlXnT^TAAAGTTIT 

GATGGCTCKnriCAGCTACGTTGGCAGAGCTCACyr^ 

ATCTCGCICITrcaUSAAAATGCGCTGAAC^^ 

AATATCTCACATCAGAATGAATCSAATATTCACAC^^ 

CAAATGGTTATTCAAAGAACTATCTGTCTTGACAGCXrr^^ 

AAATAGATGCACAAAGCMAGQGAATCAAGTAGAATAACATGGTTrATT^ 

CATGCTTATCATATCTTCAATCCArcATTTATT^^ 

ATACTCTGGCTOGGGTGAGGGCGGCAGGTanCTGGGC^^ 

TCACnCKWVGCTGACCTTGGGGAGCAGATrc^ 

TTTGAGCTAGAGAGACy^TAGGTGTATOGCAGATGAGGGACCy^ 

TTGATACTAACGCTACATGAAGAGGGACGGGCXrAAO^^ 

TTGACAGAGTTGTCmTIGTTTiaS^^ 

CTTCnTACATAGGTATACATGTGCCATCGTGGTTTGC^ 

irrTCCATAGCTCTTIOTGATGCTTCXriT^^ 

CAGCTCCCTCXCrAATCTTGerAGAAGGTAG<X5C^ 

AC7\CTCTAGTGAAATTCTATGTGTAAAATAATTT^ 

TATTTTTTAGAGGTTGTAAGCTAAGTAATAACATTCTAGAAALn 

CGC17VATG<X3AGATCCTAAGTTTCAQGCTCTGCTACAGA^ 

CCGCGACTCGGTACCTGGTOTATAGOTOQCTAAAAAGAAGTTGTTGGGG^ 

ACACGTAGGGAGGCTGAGGCAGGftGGATCTCCraAA 

TCTACTATAAATTTAAAAATTAACTCSGGTGTGGTGGTGCACAT^^ 

GGCAAGAGAATCCCCTCAGCXrCAGGAGTraSAGG^^ 

AGCAAGACO^riTXXIAAAAAATAAATCAAATAAATAAATACATAATAAATAAA^^ 

GAAGAAATGCTGTTITAGGAAATCyUVTAGGATAATTT^ 

GAGTCXaGTGGCACAGTCTCGGCTCACTCTAAG 

AGTTGGGACTACACXXXXXCGCCACCACACTC^^ 

CAGGATQGTCTCGATCn<X:TGACCTCGTG^ 

GTGCXTGCXXIAGGATAATCriTTAAGAAGAAGAT^^ 

GTCrrrTOXX:TGGAAAGGGTATTCGTAGCAAT^^ 

GCACACACACACTTTKTIXXnTm^^ 

ACAAAATGCCATGGACTGAATAATATATAAACAACGGAAATGTATITCT^^ 
AAGGCACTGGCAGATTCAGTGTCrreGXGAGGAT^^ 
AGAGGCAACICTCTGGGGTATCTTTTAAAAGGGCACTAATGGCCAG^ 
GGGAGGCCGAGGCAGGTGTCTCGGTGGATCACTTGAGGTX^ 

TGCTTGAACCTQGGAGGCAGTGGTTGCT^G^ 

CTCCATCAACACTAATAATAATAAAGTAAATAAAAGGGCACTAACCC^^ 

ACCTCCCAAAGTCCGCACTTCCTAATGTTACC^ 

AGACCACAATAATGTITrATAATGTTCAlTITXXrm 

ACAAAACXX7«XGTGGTGATTTTGGCAGCCIT^ 

CTGTTCAACXXrrcCACATGAACTGGTCAGrrTAGT^^ 
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CCAGCACTTTCXXyvGGCTGAGGCAGGTGGATCACT^^ 

AGAATAGAAAATAT^AGGTGGGCXmOTGGCGTGCACCTGTGGCCT^^ 

TGAGCXCAGGAGGCAGAGGTTXSCAGTGAGCCGACATTGCA 

TTCAAAAAAAAAAAAAAAAAGTGftCCACACTGTCKX^ 

ACAATGCTCAGTGCAGTGACTTTGCCGAGCAC^ 

TGAGGAanWITCTOKnX?IT(nX3\^^ 

CnCCXXCTGCTCXTTCAAGCTTGCATOX^^^ 

TTtnGCATGGCTClCTTTTCTGTGC 

GAACTCCXOTAATGTTTATTTGAATGATQGAATGAAGGTGTC^^ 

TCTTTGCCTAGCATCTTTTCATGTGCACATT^^ 

TATATTATCATTCATATGAATAATGTrAAiCJATrAGTGTTrATT^^ 

TATTCATGTCOTCTTGAATTAAATTTTCATGGCAAGC^ 

AGGCTK3GAGCX:AGGAAATGACTCACTGAAGGCCACAGCAC^^ 

CCATCTGAAATGGAAAAGCACTGACGCAGTTCTCTWrTCrit^^ 
CAGGTGGCTTAAACAACAGGAATGlTiGTTCCIC^^ 
CATAGATCAAGGTGTCAGCAGGGTTGGTTICTCT^^ 
ACAlGGTCTTCCCrCTGTGTGCGTTTGCATCCOT^ 
CCATATAATGACTTTATTTTACCTGAACTGT^^ 
GGTTTGGGCTTCCACATACAAAGriCTGGGACACAGTACGC^^ 
CCQQGGGAATAAAGCTAGAGTTGCrrTAATCXriTCTAATATGTATAATG^ 
ACCTGAGACCTGAGAGGAGGAGAAGGAAGCrnS^ 
CCCAGCTGACAGGCrATACGCGGACAC(nTO3ATGTQC^^ 
AAAGGTOCAACTCCCACTGCTGCIXXAAGAACAAAGAC^ 
CATTrAAGTGTTAGGCCAGGTGGGGTCGCTCATCnCTCT^ 
AGGTCXXX3AATTK3AAACCAGCCTGGCaWW^^^ 

AGTNTATATAAATATATATATATATATATATATATATATATATATATATATGTAQOCATGCATG^ 
GTCCCAGCTACTIKKSGAGGCTGAGACATGAGAATCGCl^^ 
TGCACTCXy\GCCTGGGCAACAGAGCAAGADQCX:ATC^ 
ACACACTCCAGGTGTATATCTCAACAAATTIOT 

ATACAATTTGlTAAATACAGAAGTAAAroGAGGCAATAGTGAT^ 

TGTCTGTAAAGCCTWjCACTTCGGGAGTCCMGGTAGGAGGGT^ 

TCATGAGACCXXATCTGTATCCAAATAAAAGCAACAAAAAAATGAGACATGGG^ 

AGGAGGCTGAGGCi^AGAATCAClTGAGCAAAGGAGGTrGAAC^ 

ACAGAGTGftGAACCTGTCTCCAAAATAAATAAATAAATAAATAAAAATA;^ 

CTCAGrrCCTGTAGCATGTGCACTACTCCTGATTAACC^ 

TTTTCTGCCAAATGGAAGATCATCTCTAATTAATAATGAGTTAG^ 

GATGGTC5TACCI7U3TAGCTAATC3GTGAGTAGGATGAAT^ 

GAACTGGCTrAACCAAGAAGCAGCAAGCTGTTTTAATT^^ 

GGCCCATGGCmCTCCTAAGAACCTGCr^ 

GGOTrGACAAGTITXXTCCAACAGACTCGTGTAGAAGT^^ 

AGGCCGAAACTIGGTCCCTGGATGGGAGAAGATAGCTGTATCATTGGa^ 

TAGACACTAGATAGTGGGTAGGGTGGGACAGTTTTATGTCCTC^ 

TTTGAAAATTTGGCACATTTTAGTATTTGGTCAGTC 

TrcAGGAAATATGOTrTTTTTTTTCCTTTT^^ 

GTGAGCTCritXriGGCAAGTGTGTCAAATTTATAAATAT^ 

AGTTTATnCTTCTITCATTrTTTCATGTCCAGGl^ 

TTCATTTACCX7«jATTGTGTTTCTX3GAG^ 

AATCCAACTCAAAGGCTGTIGAAAGGCTCCTrC^^ 

CTGCTGCCACCTITCGTCXIACAATITAT^^ 

TTTAACTCACAanGTGAAAATGCTOGGGATCrGC^^ 

GAATTCTACTTAACIGGCATATCATTCTGGTTAACTGAACT^^ 

TAACXrCCTACTCACCACCCACCCAGTTCCTX:^ 

rrCACCTCTCCCCTGGTAACTXyVTTTATTTTO 

TCTACTTAGATACACATTTCXIATITTACrTCTAT^^ 

TTOCTKXM^CTTTAACCTTCTGTTGTT^ 

AGAGTCTTQCTCIGI'IGTCTAGGCTGGAGTCX^AGTQGCAC^ 

CATTCrCCTGariCAGCCTCCGGAGTAGC^^ 

GTAAAGATGGGGTITCACCATGTTAGCCAGGArayiC^ 

GTGCTGGGATTACAGGCGTGAGCCACCGTGCXrAGCC ri^^ 

TGAAATTGTCCIGGTmTXrnGATC?^^ 

TCX3CTACXXy\GTT(?ITAGCT3CTGAA^ 

CAGATAGTTTTAGTAAAACCTTGAAAAGATATGACTATTG^ 

CTCrcCTCXrACCAAGTTTCTAAGCTCGa^ 

CGCWSTGGCTTCATGCnTGTAATCCCAGC^^ 

AGCCTGGCCAACATGGTAAAACCCCACCTCTACTAAAAATACMAATTAGTC^^ 

AGGTACACAGGAGGCTGAGACATAAGAATCGCTOIAACCX:^^ 

ACTCCAQCCTGGGTCACAAGAGTGAAACTCCATOT 

CAGACGGACCXTAAGACGGCCCCAGTGATACCCC^^ 

GAAaxn'AGGTGGGACATGGGACTTACTTCrrcAa:AATAG^ 
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TACACTATATCGCAAAGO^GGGATITIGGAAATACCATTAAGGT^^ 

QGGATCCTCCTGGGTCGGCCTGACITACTAAOGG^^ 

TrGCTGGCrrKyUU^AAGTAAGCTCXXn^^ 

GCCJWXAAGAAAATGAGGACAITAGTTXTATAACC^^ 

TGATTCACCATCXrCCAGTCAAGCCTCTGATGANGAC^^ 

GGATGGAGAACCX:AGCTCAGCaVTGCX:NCAGfiNC^^ 

TTTAAGCTGTTAAGCTTTTGGTAATTTGTTAGATGGtKrAAC^^ 

ATCTTTCTTTGGCyVTTTOKXrrGCACAGC^ 

CTTATAAGAAACCTGTACTCTTTGGCX:AGACATGGTa^^ 

AGATCACCTGAOjTCATGAGTTrrcGAGAOCAGC^^ 

AGCCAGGCATGGTGGCGCATGCTTGTRGriaXAGCXXSACT^^ 

CAGGGGTTACAGTGAGTQCGGTGGCAGCTOGCSACATrAACC^ 

GTCACTGCACCTCCAGCCTGGGCAACAGAGTC^ 

TArrrTTCATTATTITCX:ACTATGAATGAAGAAGAAAAT^^ 

ACACACnnCCACCTACAGAGTCACATATTCTATAGAATAAAAT^ 

AGTCTTTGCTCTACAGGTCAAGTTGGGACTTAC^ 

TATAGATCAGGAAAGTAAGAACCCACAGCATCTTGCACGTAG^ 

TAACAGGTCATATGATTTCX:ATTTC7ITrGCACA^^ 

TGAAACCTTGTTIT'ATTTAriTATTTICrigAT^^ 

TGGGTGCCXXIAGCATTATCnTCTAAAGGACATGTAT^^ 

TC^VCXAAACCCTAAATAArroTCTGGATGTTC^ 

TAGAGTGCATTAACTTCTTTATGACATTAATACTGACT^ 

TTGCCCTCACAGGACAATATCTTTAATAGCITCTGTGTG^ 

ATACSCATACTGAACCCATCTAACTTGGCAGAAATrcAGAAAATC^^ 

GGAAAAAACCCTCACTTTTATTGCAGCAAGAlTTnCICA^ 

TTCTCCAGCACCCTGGCTQGAAATTATITATT^^ 

TCTGGCTrGAATTTGTTTGCCTAAAGGAGCAGATATC 

TGTTCACCTGGAGTCTGTACACytflAAAACCrTC^^ 

GGTTAATGTCCCCGCTCACCXTCAGTT^^ 

GCACTCCnTK5ATTTGTAGCCGCCACACATGGC7\^^ 

AACTTTQCTGGAGAACATCACCAATCACGCICAGGGAl^^ 

TTTAGGCTGGGGTCACCGGGGTCGCreCCCCAGOCCCACTCn^ 

CTCACTAAAGGGGAGGAATCCCTGGAGCCTCCCCATCCAAGGCCA^ 

TTCACCCCCACACAATCCACAACCTACTACTAGAATGCCATAT^^ 

ATATTTTATITITKiAAACAGACTTCGrrAGAACATC^ 

TGTCTGTAACAGCCAGTITATCCTATTATTTrAACGT^^ 

TGAAATATTAGGTTGTGCAAAAGCAArrGTGGTTTTTGCCAl^^ 

AGTAAATCn xJi ' i ' i ' i ' iM XXn'ATTAAAAGTAATGCGGTITTAGTT^^ 

1 1 1 ttgccag tac t taaaag taa tggcaaaaac tgcaoAAGTAATTGGCAAATGGCAATTGCXXSTrTTTGCCGTrACT^ 

AAATTGCATTACTTAAAAGTTTTTAAGTAATGCAATTACITAAAA 

TTACTTAAAAGTAATCGCAtJl'l'iTiXXXATTACTTAAAAGTAATGGCACC^ 

CTAATATTGTTATTAACTAAAGTCCATCGTTTATCATATIX^^ 

TCTGTTay^QGAGCCXATCCAGGATACCACTTTTAT^ 

AGGATXXSAGTQCAGTGGCGCGATCTTGGCTCAC^^ 

CTGAGCAGGTGGGACCACAGGCATGCGCCACCACGCTC:^^ 

TcrrxxrnAGCATcriTC^^ 

TTGGGTATTCTGTAGGATGCCCCTGTC5TTCX3GAaTT^ 

GGGAGGAAGACCCCAGAGGTCTCnTTTCATCACATT^ 

TGACTCTGATICACCTGGTCAAGGTAGTGTTCGTrAGGl^^ 

TGCTGCCTrTGGAGGAGTGTTGCTGTXX;CCA^ 

AGTATCTGCATAAACTACTTGGAATTCTACATGGAATATITG 

TATCACGGTACACTCATGGACATTTATTTTATGClTrATG 

TTTTTCCAGATGGAACAATrrccnCT^^ 

GTt3GTX5TGTTTTAATCTITGGTTTTGTTATCAC^ 

ATANCATACACAAAGGGCNAGCArajCTACTATAAlT^ 

AAAATCATACTAGAGCAGTAGGCAGTCTQGGTGAGTTCnOXA^^ 

AGGccACTcnrcnxxTKnxsccccy^^ 

CCTAGTCTGGTCITAAAGTCCCAAAGGCyiGGCCATCAC^ 

TAACccaiAACTiaxx:GATrcyicmT^ 

TCriXXAAGGCCTClTCCAATCTTGACA 

ACCAATGAGAGTTCAGGTGTTCTGTTGGAAATCTTTOT 

AACTCGACACATAAATAGGAAGACAAAGAATCAATGTGGACTC^^ 

AACTTTGATGC7^TCCI03GAAGAClXr^ 

AGTAGACAAAAGGTQCAGAGTAGAGTATGCAGTGATTTCAGGGTG 

GTTA^nCAACATCAAATGTCTGGATCAAGAGAGGCAAGAATC^^ 

TGTTrTCTGCCTrcAGTGTCACCTGTrc^^ 

AGGGGTGAAGAGTTCAGTAAACTGTGTTrAACCCAA£3GAAAGATAC^^ 
GCAGTAGATAGTCGCCCTACAGTGGAATGAACCTTCACAC^^ 
CTGGATGCTTGGAATTGlCAa<XX;TGCCXncI^^ 
AGGCXriCIXXXrrCATGTCACATTGAGTGGTOrrA^ 
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TCTCAGCCTTCXAGCAGAAlCCCCTTTCym^ 

GTCTCGGGTTGGGGGATTGTCSGGNTXXrCGAGAATCAG^ 

CTGAAAAGTACCCACCTAACAGACTTGCCAGCTC^^ 

CATGCAAGGATAATTACACAGAATAAGCACTGAGATGCCIGI^^ 

TCTTTGTCTGGAACTTTGCAAAATAAACACCATGGT^^ 

TirrAAATTCAGGGTTAATITCTTGCTTTCAAAT^ 

GATGCCGACGACTAGCCAGGCGCITCCCTTCCAGCT^ 

ACTTGATOGTCCCAGGCCCCCTCXriCATTC?!^^ 

ATCTGCAGAATTACTTOXATTCTCAGGTCr^^ 

CAATAGACCATAACTCCAGTGTTTTATreTGCTTACATAT^^ 

AAAATGAAAACTTTTCAAAGGCXACXIACTGCTCGCATTCC^ 

AAGACTCTGAAAACAGGAAGAClTITKriCTT^ 

TrcGTGAGGTTAGGNCCTrariTrGTCTC^^ 

GTCX:CATATATAGCAGTGTTGTCAGTAAGATTTCTC^^ 

CNTTCmTCAGGCAGAAGTTGGmGTCATTTCT^ 

CCrGCTTAATCCCTTCACTGGCTCACXATATCXCTTAG^ 

CCAACCTCTGCAGCTTACTTnn'ACCACTTTCCC^^ 

AGCXnCTTGTCTCTXSGCTAGAGTAGTCTTCC^ 

GATCATCACTTlCTCrGGGGTOCAGGGGTCTTTCATAT^^ 

AATAGAAAAAACTAATTTAGAAATAGAAAATTAATTGAGTTITAff 

AGAAATACyVAAANTTAACGTAGNATCACAGAAAATTATCCAAAGTC^ 

TCCrcCCCACrrcTCITCCCTGCCl^ 

CAAGGGCAGGGATTTTAGTCTATTTCATTCAAAATCTGTCArc 

TCAATGAATGAATGAATGAATGAATGATGATGGCGTGATGA 

GGCACTGGGGTCCTATTTTGCTCACTGTTCXr^ 

(nxSCATCTGGGAACrTGGAATTTAGGACXX^COTCT^ 

CTrGTACCAAAAATAGGGCTCAlX?I<X;AATACCTGCTT^ 

GTGCCTACGTGGCCAACTTCCAATAAAAACTGTGGACACTGAGT^^ 

ATATTITCACAGCTCATTTCrGGGGGACTAATrAAATGCAT^ 

TACCTOSrmmiCTCCAGACTO 

TGATAAATCrcAGCCATGAGTGTCACCGTATGCTGA(^^ 

TICAGTGCCTGACACAGTTGTATTCXrrAAGACATAGrrGG^ 

TGAACGAAAGACGCCATACGCAGTCACCCCCCCCTCT^^ 

CAlTCGAACAAACGGAGGGTGGTATCriGTACTGAAAAT^ 

ACATAACXrACTATTTATAlCGTGTTTATATTGTATTAGGTATTATi^^ 

GTCACATACATTATATCCAAATACTATACAATITrATGTrAGGGACTT^ 

TCXrriGAACCAGTCCXCTTTGGATACTGA^ 

ATCTC^TCCACCCCTTCCTCACCCAAGATIX^ 

GGTITGCl^GACAGAGCTGGACAGGTCXIAC^TXXATT^ 

AQCTATGGCAAACTGCGGAAGAATTACXSACAGATGTATGACTTAaT^^ 

TGTOCTITATTCTCAGAaTATGTTCAACAGATATT^^ 

TCTOGCTCTGTCACCCAGGGTCTAGCACAGTGGCACGAT^^ 

TTCTIXXCTCAGCCCTCCAAGTAGGGCTACACGT^^ 

ATGITGCCAAGGCTAGTCTCTAACTCCTGGGCTCAAGCAAT^^ 

GTGAACCACCATACCCmGCCAAAATATTCTTl"lCAAAGTGNATAAGA 

CTTCTOTCTTACATGCACTTTITCATT^ 

rmAAGTCTGCTTACTCGGAAAAAXAAAACAAAGTTGAC^ 

AAAAGTCTTCnTXXnCCATGAAATCCAGAAACTGGAGACC^^ 

CAOmGAGATCAAGAGTCJVGAAATAGCTGGGTTGAAT^^ 

GTCTrTCrcAGCTGTTTTTATATCrrGTACAA^^ 

GTGAGGAATAAATGATATAATTCACTAAAACACGTAGAATAGTATTTG^ 

CAGANGTCATrATTGAC2^^rITAlTCrGCTA 

GATCHTATITrTrTGAAGAGGGTGTAGGCiyuVGAAAGGGt^^ 

CCTCCTCATACCACAGACACTCTCrr^^ 

TCICGTTCTGTXnWCCCAGGamK^ 

AGGGATTNCTCCTGCCThKyVGCCTCTCGAATAGCT^^ 

OTIAGTAGAGITGGGATTTCTCCATCTTO^^ 

CCCACAGTGCTGGGGTTGCAGCTGTGAGCrACT^^ 

CroCTACGTGTAATAGAAAGGTAGGGCCATGriTGGAGAaGGATC^^ 

TAAGGTtXXXAGGCymxnXSGCTCATGCCT^ 

GAGTTTGAGAGCAATCTGGCCAACATGGTGAAAC^^ 

GCCIGTAATOCCAGTTACICGGGAGGCTGAGGC;^^ 

TCGCX3Ca\CTGCXXrrCCATCCTAGGTGAO^^ 

GGArra:AGAGCATAAGTATCTTCTATGGQGCTACay^^ 

GGATTXSCAAriTTOATTCACCCCTCCC^^ 

GTATGACTITAAAACAGAACXyOSAGTTCAGAACTrT^^ 

TGGTlTCCAflCCTITCACTGTGCGGGTT^^ 

CTCCTCCTITrcATAAACATTGCTGCAAGGCCCXS^^ 

TCGCTGTCAAGTGCAAAGTTXXSGCTGGCAGGCCl^^ 

GCTOEK5GTGTGCGGTXXWVCCCGGGCTGTGTTC^^ 
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ATGGAGGTCAACTACGGGAAGCAAGQGAGGAACCATCTGAA^^ 

GTCCACXrrCIXXrr<yvCCAAAGGCTGAGTGa^ 

CAACTGACAAGCCAGCTGTGGGGTTGAACCCTGTTATTCAT^ 

AGATGCAGCrrATATGACTCrrGTACCAGAGAGAGACC^^ 

TTTmX5TCnCTAAATACAAATGTCCftGGAGAATAGGAATGCCAT^^ 

ATCACAAGTATCTCATTATAAATGGATGCTGAAATGICCAAGTX^ 

IXXTTAUClC ' lTiCACTTATTCCCTCATTC:^^ 

ttctagggacnaqccgggcgcagtgqctcatgcc^ 
tcaggnagttcgagaccagcctggccaacatcxstgaaaccccgt^^ 
gcgggcgcctgtaatcccagctattcaggaagttgaggca^ 
ccaagattxxxxx:actgcactccagcctqggtc^ 

tcx3agagagagagagagagagagagagagagagagagtgt<nctgt^^ 

tacgctcxnx3ttraggtgggcttc5agaaaagtgctk^^ 

gcctgtaaataaattcataccnaacttcagccccyvtgct^^ 

CGGGACNATATTTCCCCCTCTTANCAGATACXSAAACCTT^^ 

AATGTGCCTAATGCCAAAACTGTACCTOTOCCAC^^ 

TCTGGCTCTCXXXrrATTAGAATTAGATGCGTGAGACCXri^^ 

CiayrTGTCCX:TITAATCCATCGCCCAAAAAGCCT^ 

TCCTGATTAGGGGACCTCTGCGACTGCTCATCTGGGA^ 

AGGGAAGGAGAATTGTGAAACATGCACAAITAGAAGTCGTGCACCTGG^ 

GCGGGTGGGGAAGGCAGACAAGTOXTrcATCCTCTtr^^ 

TCriTATTCAAGGGTTGrrGAAAGGATGAAGrroCT;^^ 

ClCACTACTTATCGCTGlCAACATCATCrrCXXn^ 

TTrarCTTCAGCTGGATGTCCATGTTCX;^^ 

GCAGATATCAATTGGGGGCAATTGGAGGGGAAAAAGATGCTTTCX^ 

CaGAGCCJ^GAGAGGTCACTGTCXy\AGT70^GGAAa 

ctgcx::aacacgaaggtagatggacggtccitatgctcgc^^ 
ctctagtagtgattaaaatgcctttgacaggctcttgc 

TATAATGGAAA(XAGGTTATTATTGATAAGGACTAGACATATTA 

GGGGAGGGAGCTAAGGGTTATTGCAAACCTACTGTGTGTCAATTGCT^ 

GACCXXCACCrrmGAATWAAACCAAGTTrGGCCT^^ 

GGAGATTACAGTAGACSGCATCCAGTTGATTGGGGAATAGT^ 

AClTAATOGAGGTTCTGCrcriOTACTAhr^^ 

TCAGTCXriTITGCTTACTAGTAATGACCCTCT^^ 

TAGCCTCAAGTTTCTGGAATTCTAAGAGGGCCT^^ 

AAGAGGCATTCAACCTCCTGTTAATTTGGTTCCrCCC^ 



TGGTITGGACGCGTTCCCCTTGTCTCnXXyUlTT^ 

GTCCMTCCCnxnCATCTGCAGCAC^ 

ACTGCCTGCATTTTCTAATCTTCACANNATAACT^^ 

CTGAAGCGGGCAGATCACCTGAGGTCAGGAGTTTGAGACCAGC^^ 

TTCrCATATCTACTAAAAATACAAAAATTAGCTGGGCATCXST^^ 

CAGCAGAGriCKnTGAACX:CAGGAI3GCAGAGGrr^ 

AGTGAAACriCTGTCTCAAAAAAAAAAAAAAATraTTT^^ 

ATGGTCCACAGTGATCTAACTCAICAGTGGTCCNACC^ 

NTCCrcTTTOCNTTACTCANCTGACNATCAT^ 

TKXrrciATGCTCCriXSATGAAGGGCTC^ 

AACACCrriQGTGQCAGCTAAGAGTGTCAAAATGGAGGGATC^ 

CAGGGCCTATTGTGTTGTGTGTGTCTCC^ 

TAGATKTCNAGAATCACTTTCAACTTCXrr^ 

CCCAANTATCAQGCACNOTACCCTOACACTC 

GCTOSATTTCTATTCCXATrTTCXyvC^ 

GGGGCAGAGTCAGGATCTGAACCXZAAGACATAGGCCCAa 

AGCaCTTTTCATTCCCTGTCCTITCC^^ 

TTTTCTTTCCTCTQCAAGCAACATC^^ 

TCCTITTGlCI«XXnX5TACCTGCX:TT^^ 

TGCCAGTTGACACCAGACTAATGCTTCTGA^ 

AGAGGAGAGATAGAGCACTICAGATTCCACTCGAATACATTAC^ 

TCTCTGGGTrGQGAGGAGATCTGGCATTTCTAGGAArrcAC^^ 

CltXTOTGAAGATTCTtfSGAATTTT^^ 

AACCTGAGAGATATGAGTXn!AAAACAAGAGAATGATTC(^^ 

TACTOGGTATCAGQGCTGCTTTCCTlTATAAra 

AAAAGCTOXKXXZAAAAGACCAGTCriCC^^ 

ACCAAAAGTCTGTTCAGTITGGAAGACTTGAC^^ 

CXyVGGCATCAATCACACATGCAGTGTGGGCCCAGG^ 

TACAAGGAGACCQGCACXrPGGGCTCACACCTTGCC^ 
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ATTATTCTCTCACrrrcCAGATGAGGAAATGGAAGTC^ 
ATTGCTGAGAATAAACCCGGAGTTGCCTCTGGGACSGACC^^ 

TCriTGGCrcJ«riXX:AACCTCT^^ 

GCCCGCCACCACGCCCAGCTAATITITTGTCn^ 

CTCCTCACCnCAGGTGATCCACCTQaXCA^^ 

AriOCTIX^rGTGTGAGGATGAAGAGTAATAGGAAGATAfiA 

GGTGTTCGCAGGGCMCAGCTCGTTTOCriTI^^ 

AGATAGGAAAGTGCCCAQCAACCCCAGGCAGAATCATTCTCTr^ 

GTATTGAGAAGAAATCXnrCTTCAGGTITXSGAG^ 

TCATQQGTCnXSTGTGTQTGTGl'l'l'lCTlVlUXSGTG^ 

ATCX?rcACTTTTAGCGAATGGATACTATTGAAGGAATGT^^ 

GGAGTOCCi^rrcAAGTCGGCCGCTCTG^ 

GGCTCACGTCTCCACATICTGCACCTCCCAGGCTC^ 

TTGCATCCGAAACAACCACCCGTCCTCAATATGTAGCmy^TACGT^ 

CCTATGTAACAAACXn'ACACATanXXACATTGTACCCT^^ 

GTGGCTCACGCCTGTAATCCTAACAATITiaKS^ 

CTGGCGCAACACGGTGAAAACCXXGACTICTACTAAAAATAC^^ 

TCCCAGCTATICGGGAGGGrGAGGCAGGGGAATroCTTGAACCTGC^^ 

CTGCACTCXIAGTCTGGGTGACAGCGCGAGATTCT^^ 

ATGGAAGGAAGGAAAAGGGAAGGGAAAGGAAAGGAACCAGCTtXrAAAAArrcTCCr;^^ 

ACAGCTGAAACACTCCTCAAAGCTGAGAGGAGGGTTTGGGAGAG^^ 

AAAAATICICTTCCTTGGATTTCAAAATCAOCCrc 

TAAGTTTCCCCCAGGAGATTTTTTTITrCTA^ 

TATGACAATAGGACCCATCCTCGAGAGAAAAAGCCTTCTI^ 

CACTTGTAGTGTCAGGGTTATCAATATCCGTCKSC^^ 

CCTCTCCOO^CCCAATCCCCKTCACGCCOT 

GTAAATCAGCTCATTCATrcTGCTGGCCCTAAATAGGTA(^ 

TGCAGATGCCCGGCTTGTGGTTAACCCAAGCAAGAGGTAGTTAAACC^ 

TAAGlTrATGTATGGATTAACACAGTGATAGA(XXrK^^ 

CACTCGCCTQGGTCATTGAAATTAATTTCrOCT^^ 

ATTAACGGGGAGCCGAAATTATTTCrrcTXSCACAGGCAGTA 

CGGAAAAAAGGATTATTCCTCCXXriTTTATCAAGAGCCC^ 

AAACAGCAACGAAGTTATreGAGAGTTAAAACGTAAAGACICCAGTA^ 

CTGTCCCXXTWVTAATTGACAATTTer^^ 

CCTAATCCAGAGATAAlTKnX3AATATCX:ATTCT 

AGGAAATGGGTQGAGAATGTAGGGAGGAGGTGGAGGATAAAGGATAGGACTAGGAAT^^ 

GCTTCTGTCTCXAGATTCCTTGTTC^rGTCCCTTO 

CTCTTTGGGCTCXrTGCTITrATTTXrGGCCX^ 

GATTCAATGCTOCXTTAGGTCCACAGTarrcAG^ 

TGAGATTTCCGGGCAAGTrcAGTCTCCATCyrTTr^^ 

ATGTTTCCCITCTTTGGTGTCCATGGCAAGT^^ 

GTGGGTGCACATCyWXSATACCTCAGTGGGGAACTCAGGG^ 

aTCTAACTGTCX:AGCTCTTTAGGATTAGGCAATACTT^ 

ACTCATXX^rACTTOGTGTAAAATCCTGGGATGATAGCTTA^^ 

GGATrAAATCAGrrCATGGAGATGAAATACTTAAACTGTXX:Cr^ 

TGCTCTTAACCCCAACTAGATAAGCCAAGATAAGGTtXriCT^ 

TATCXlAGACCCAAGAGGATCXGACAGCrCTGGAAClTGTAAT^ 

CAAAAAAGGACACGTATGCCTAGAAAACATGAATAATrATCATCTATCA^ 

TTGGCrAACGAAAACAACAAAATCAACCGTGACAAGGTTTTATT 

TGCTACCAAGTCACOCATGAGTAGCAATTAGGCGACTTCTAACATG 

CTTCCACATCTCATTAGCACTCCCCAGAGCCT^^ 

GCITATAAGGCTGTTrCATGTATGATTTTTATATCT^^ 

ATATACGGGTrCCTTACCTICTAGGAACTGATTIA^^ 

ACTCTTTGCTTGCTCAGCAGTrca^ 

GCCTGAGACAGTTGCCATAATrAGTAACCTGCAGCTCrC^^ 

AAAAACACAGCTGGGriXXrGGTGGCTCACGCCTGTAAT^^ 

AGGAGTTTGAGACCAQCCTGACCAACATGGAGAAAAOT^ 

ATGCCTGTAATCCCAGCTACTCAGAAGGCTCAGGCGGG 

GATCACGCCATTGCACTCCAGCCTGGGCAACGAGCGAAAC^^ 

AATACTACTTXrrrcCTCnCCTXXXrCACCCTAAGCAT^^ 

CCCCATATGTCACCTtXCrcCrCTAOtXAAACAAAAAT^^ 

GGGATATATAGCGTCAGQGAGCGTATTTTCAAACCAAAAACC^^ 

TCXrrcTTATTGCTCAGQCTGCTXn'ACAGTGGGGC^^ 

cctgcxrrcagcctcccaagtacctgggattacaggagc^ 
tgaggttgcaccatxottaccaggctggtctggaactc^^ 
gggattacaggcgtgagocactgtgcxx:agcxaaac^^ 
agtcaggchtaagaatacattagtaggtacagcaggagagcatct^ 

Tn-AGGAATATAGATAGGAATrACTAAGGAAAAGrGTArrrnxnOS^ 
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GAGTTrrTGGTTCrACTCTTCAATCAGTTTA^ 

GCATTGCATAGTTAAAGTTCrTTAAATTCTTCT^ 

riXXTXSTTACCAGAATCTATCAAGCTCTrrcCT 

rrcTACCCTTGAGGAGGGTACCACCTCCTCATK^^ 

ACCATCTGTGTTTTriTCCTATCAATGCTTIAA^ 

ACCATTCTGGAGGTCAAAAGTCCAAAATGGGTCTGTCI^ 

ATTTACAAAGGAAGGAAATTTATTTGTCTCATGTTTCTG^^ 

GGGCGTCAGGOWCXrTTCTACTCATCXnt^^ 

AAGAGAGGGGAGGGGAGGTXXXIAGAGCXrmAAAAGAACCAACTCTCT^^ 

CATGAGG(yVTCCACCCCCGTAACOTAGACTCCTGTCATCAGCC^^ 

ATTIOSAAGCAACAGACATCCAAACTATAGCAGGGTCTCC^^ 

GGCTTCTGGGGAGAATTTGTTCTGACmr^ 

ATO:TrAAAANCCAGCA(nGCAGCGTCTTCAATIC^ 

GTItXrATCTriTrAGGTTTATTTGTGACCTTITA^ 

TCCTCATGTCAAGAACTTACATCCCTCTACAAAGriCCr^ 

GCATQGATGCXOTTTGNA^^^mGATTm^G^ 

CICGGANCTCOTKXSGNOTKXATCAATO^^ 

NACTTGGCCTTTIWGGCACCATTATTCTQCCrc^ 

TTAQCCCTTCTrTCCTTAGCXrnXTAACAC^^ 

CTCnCACCCNNAGGCTGGGGTGCANGTGO 

TXnaXXnCAGCCICTCTGGTAGCTGG^ 

ACTAGGTITCACCATGTTGGCCAAGCTGATCTa^ 

TAGGATTACAGGCCATrcTGCCTGGCGTATATATTTTAAATTTTAT^ 

CTAGAACATAAGCTCTCTGAAGTCAGGGACCATGCCTAGAATCGCC^ 

GCTATGTCGCGCIGTGTTAGGCTGTGTTGGCTGTCANGTAAGAGAAGT^^ 

AGGGGGGATGCCTIQTGGAAGTGACAACTCriT^SQCACGTAAC^^ 

TCAAGAGTCTTTCTCTTCCTTTTGAACCCAGCTC^ 

OTtSAAGATGTGAAACTGCCCAATCXXXrCMTCTGGA^ 

CTGGGAATCACTGAACCAACATGAGTCACTTGCTACTGl^ 

OITCCCCAAGGTIAATCCAAATGCTTTACM^^ 

GCTGCAGACCATTGTTTAGCCCTCCTTCCACCTGG<^^ 

CTATCATTACACAGATTAGCACATGGlTCCAIXyVTCATGGArc^ 

GGGGTCTGlTAGAGAAGGGAATXX^ATCTrcTGAGGTAT^ 

CCXOTAAGGGTGGGCAGATTCCriCCXXy^lTC^ 

OWirATCAACATCAGACCATCACCrTCT^^ 

cagaixxagagatgaaacxx:agatctaactggacxx:aagac^ 

gtctogitatgtcaagtggaatatttctggct^^ 

gctottatoxttattittctaaaaggcaacattagtaatcaact^^ 

tctgatttgctttcgcttagtattttaratrattc^ 

tattcacccagaacaggtgtgtgccacxr'itggaga^^ 

aggtgaatgaatacattaccacctcccaggagttgaatgaaaaa 

tcnctaaticatgcatatccctatacaaagacxxmg^ 

aatgcgaggcccttcttgtaaggaaacgcixsgtoggt^^ 

tgctttgqgc^aagagittttgtcaagatcan^^ 

tgctaatgagacaccaaagatacacatggacaat03ccanga 

ctctggagcnagtcaactcagaatggtcaggact^^ 

aacgcaggaccaaccagagaaagccacatatgctcxxxmaccaatcat;^^ 

cgqctkxxx:atctccacagcctcc^ 

COTTCTCTCCJUXrrcCTTT^^ 

GTGACClCTATTTGTTATTATTTGGGTCCCCTTTXn^ 

ggcttttcctgtttgtttctttgattgttt^ 

tctcgctctgtcacccaggctggagtgcagcagtgt^ 

ctcctxxxrrcagtctccx:aagtagctgaaattacac^^ 

ttttagittfsagacgggatitcaccatgttggcc^^ 

crtccgaagtoctgggattacaggcgtgagccj^^ 

actwtttaccagatkksagtgcagtggtgc^ 

cacctctgccactrcaataggtgggactacaggtgggt;^ 

ACTCTGTlGCCCAGGCTGGTGTTGAACTCCTGGCXnX^ 

GGCTIXy^GCTACTTCCCCMGCTCTTCXXXX^ 

GCTTACTTItXXTITCTGTCCTTTACTTATGAClX^^ 

rrATTATTTCATGTGTTAAGTCTACAGTGTATAACTTCGGT^ 

AAATACTCAAAGCCTTCATTITQCAGGTCTX^AT^^ 

TCnCITATCriGTAATrcTATGGAGGGAGGTTCCATT^ 

gcctattgtactitatctgtgtaactgtcgtagcagctagagg^ 
agaaaagggagaaaagcaggacaaatgaacxx:agattaaggaaaataaaaaagagtg^^ 
a^ctcatttttct^tg^^7w^cacattac^^ 
atgaagaagtatgtgtatwccgcttrrtgtctttlo^ 

NCrorrCTTAAATIXXATAAGCTGGAGACACTGCCOT 
TGGTGTCCGTGrrGCCTTGCTTCCTAGGCTGTCCAC^ 
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CICAACCCAAAICTAAAGCCCGAGTCGTATGAATCCA^ 

CTCIGGTCTTTGAGArrATGTAATTAGCATATGTIX^ 

AGATTTAATACTITTAAATCAGAGGCCAACATGGAC^^ 

mTATGGAATGGGTTTCCATGCXATCyiX^rcCXX^ 

TTCCCAGCCAGCCAC^TGATGTAGACAGAATATTTCGGATTTAT^ 

GGJATAGAGAAAAGGTGGCITGGCTGTTTGGTTTATGTrc 

TTCGTAAGCTrAAGAATCATAAAGAATTGGCTTTTC'^^ 

ATCTCrnCAACTGGTGGATTCACGAGGAGGTTi^CX^ 

AGAAGGCCACAAAGTTTTCTTATATTTTAAGCATCAAAOT 

CACXnXSTGGOCATCCCTGGCATGCTTAAACAAGAAAGGC^ 

G^K^GCCAGAA0CXrroTGGr^CeITTC^X2ICTACA^ 

CnCAGGTTGAATACAAGCAGCTATTCAGTGTITTACTC^^ 

TCCXTOGAA<nxnGAGCAGATTGTTAAAAT^ 

CTGAGGTCTAGGGCTAAACTCTTGCCTTTACCTG^ 

TTGCTCACACAGACCCAGCTCATCTCCCCAGGCXIJVTC^ 

AGTACATAACTGAGGAGTAGGAGGGCAGCAGCXXAAATCTTTGATGAA^^ 

CGAGACACTGTTGGATCX^GAATCTGGCTTCTAACAAAC^ 

TITIGTCATGTGTAGAATGGGGAGTATTGCTAGCTATTCrCAT^ 

AGGCACAGGGATCX3mX3CCATTATI«:AGTTACAT^^ 

ATGTATTATTNGGCATTCCTAGCaiACTOlTrrAAAGGAAGTAC 

CCAGCTTGANGACCAGGGCrrcCTGGTCACTrAAAAGAT^^ 

ACnTTQQGAGGCTGAGGTGGGCAGATCACGAGGTCAAGANGATCGAf^ 

ACTAAAAAICCAAAAATTAGCTGGGCGTGGTGGTCAGCNACCTC 

CACITCAACCCGGGAGGTCGAGGTTGCAGCGAGTCAAGGTC^^ 

CGTCTCAAAAAAAAAAAAAAAAAGAAAAAGAAAAAGAAAAAGGTCnTTGG^ 

TCC7ySGTATTTCTCACX:GTGTTCGGGGTC^ 

CXnXXACXIACCCATCSACTCTGGATTTATCAT^^ 

ATrrAGGTTCTTCTGTGATAAAAATAGACTTGGTAAAGAAGC^ 

TTTATTCATCTTCCTTATTCATCATATAAGIV^AATTATATGAI^^ 

GCCCATCACAGTTCTGGAGTTCACATATCATTTGTTAC^^ 

ATATACATGGTAGAAGGAGAAGAAAATGAAGAAAAATTTAGAAACTOXS^ 

CACTAGCATGAGATAATCACTCITAATATTTTGGTGCATG 

ATTGGGCTCGTAATATATAGGCACATAGATATATAATTAAAAAAACAAG^ 

AAACNAm:rrAGTAATITCCTAAGAlTATACCATX^ 

GCTA^K3Ga::ATTTA^TUVTT^GGGACACTA^IT^AAAAAATAAC^^ 

ITTTGTAAAAGTAATCAGAGCAGTAATAATAGAaxwrrc 

ATGCTGTGTTCCTTGCCAGTGATTGGTTTAGAAATAGGTA 

GAGGAAGGTTKXTACTCCTAAAAGAGATACACAGGAAGAGAGAC^^ 

TAGCAGTTCGCAtX^TTGGGTTroGACAACX:^^ 

AGAACCIGGGTQCATTGTGGCATTriTAGAGCCACTGAlT^ 

ATGAAATAAGCTGTTAGrTTCGTGCAAAAGGAATTGCAGT^^ 

TAGCACCAACTTAATATTTTGAGTTGGGTTTGAATCACGT^^ 

GAGGTCTGAGOWVCGTCTICACATTTCAC^^ 

TCTKXrrCy^AQGTCGTATCCAGCTITGG 

jkx:aaaatgccitctcctaaatccaggtagangagaaact^^ 
tagaatcagaaagtgtgagttcaaattgcxritosqc^^ 
aaccctqgtrrtcagactctttattggtaaatgaagt^^ 
aaaaatataaaataatcitacaatcatcatoncatoctc^ 

ATCAACTAACroCTGTGTACTATGCTCTATTXSAAAATGT^^ 

CTTATGGTTTITATATITrAATAAAAGTGGCAGAAAAGAAATAGGTAAAACT 

CXXyvCGTTCTTTICAGATTCTGATTTCTCC^^ 

ATCTAAACCTTTXriCTCTCTCTCTCTCTC^^ 

AAAAATAAGATTTTCGGGCeiXyrGAATTTTT^ 

AAACATACTGTOtnriTGAATTITGGAAATAGACCAGT^ 

AAATTTTTTCTTGGTGGGGGGGATGCAimaATGTGTATG^^ 

attaaacattgatcatctatgtgacx:acatitaaaatgagc^^ 

gtacttgcttosgcagaactctgtgccttt^ 

caacxx3tattttacagttaaggaaacntcggctcaaact 

gcagtctgagagtgaaactcaggtwctcac^ 

gtggggcgtgggcactaatttcatttgatcacttagag^ 

ctgagaacttca^tataataacgttcgcttgatc^ 

gatccataatgtititagaagttagaagatttaaatatatxauvt^^ 

(xtgccaaaagaoctcttgacaggaaataaacarraaaat^ 

GATGCACTTTAAT1TK3TGGGGTTITAACT 

AGAATTTTCTGATATGTTCTTCTGTGAAC^ 

AAGTTAGTCTATATrrAQGTGATGTGGGCTTGATTTTCAC^^ 

ACCCCAGCTCTC3TATTCX5AGCX3TGGTTATCTT^ 

TTGCCAGGGGGAAAAATAAAAAAAACACAACATTGGAGTGAQGGTCTAGGGTGAAT^^ 
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AGGGTGGTGGGTAATTCTTCTCTCTTTO^^ 

AACATGAGGATGAAAGTTATGCrCTKnTGTT^^ 

OSATCACCTCTCTGTGCCTCMTXnC^ 

TCACCXrJCTCCAACATGaVTGTCCTTICT^^ 

CTACCTGAGGCTGGCTAATGTATAAAGAAAAGAGGTTTAATTC^^ 

GQGAGGCCTAAGGAAACTTANAAAra«Aiax:AGAAGATGA^^ 

Aa3CAAGGANGCXy\AGGNAGGANGCOTAl^ACCCACI^^ 

TCAGAACAGCAAGGGGCSAAATCAGCXXTrCTGATC^^ 

TTCATGTGAAATITTGGCTGGGACACAAATCaZAACXATAT^^ 

CTCCTTACAAGGGGTTCCrTCa^AACC^ 

TAGACTOATGTGTTTCATCX^CWJ^CAG^ 

TGTICGATACACCGCTCXrTCTGACCATCATGG^ 

GGrnXATCa^GCOTTITCCTTCGTGTACTCATl^^ 

AfflXXmXXrATCATTCATtXAGTTGOUia^AT^ 

CAATGTGTCCTGCTCATCACTGAGGITrcr^ 

TTGTTCTCATTGCCACCCTOTTGCCTGGGC?^^ 

ACAGATGGGAAGAAAATGAGGTGGGGGGAAGGGGACTTTCTTGCTT^^ 

ATCXOTCCOSACTCCCAGGGCCCTCXrAG^ 

CTCCCACTACGATGCATGCTGGGGTCAGTAAAAGGAGAACA^ 

GTTAAAGACGTGACCCTGGCCGGGACIXW^GCCAGAGCXX^ 

ATGACAGATirrAGTCCCCTCTTGAAlTia^ 

TOrrAGCTTQGAAGGTGGCGCAAATAACTCKTITCAGGCT^ 

GCLn'i' in ' ri ATrAAAACAGGTTOTGGTTTITreTGTGTGTGTT^ 

GTAGTGTGCAAAGCCTGGCGGTGATTGTTCriC^ 

TCGGAAGTGGACTAAGAGGCCAAGTCACTTTGATTTI^^ 

TAAGCCAGGAGAITCTOXSTTCGGGGGGATTTCCTGT^ 

GATGCCAGTAGCATCTICTATTTCCTCAAGT^^ 

CAAAATCACCCCTCGATGAGAACCCCTKXri^ 

AAAGTAATGAACAAGAAAGTATTAATAAGACCCAAATAATAATATAAACT^ 

GCCCTGCrCTAAACACTTGCXIATCTAGTCACTAAT^ 

TTCXnTGAAAACGTGCCATAGACTAGATAGCTTGAAACAACAGAGAT^ 

GACATCGAGCnxmGGCAGAGTTCKSTTCCrCC^^ 

TCGTGGCTGCCATTAATCCITGACATTCGTra 

ATCTTCACTCTCTGTCTTTGCTCCCT^^ 

GACCTCATIT^ACCTTAACAAGrrc^K:ATCTGTGAAGA^^^^ 

CGGTGGCTCATGCCTGTAGTCCCAGAArrTTGGGAGGCl^ 

CTGGCCAAAATGGCAAAANNCC(XTT^K:TC^^ 

TAGCTACTTCGGAGGCTGAGGCAGAAGAATCCCTTGAACCTGGGAGGC^ 

CACTCCAGCCTGGGCAATAGAGTGAGAATCCGTCTCAAAAAAAAAC^^ 

TNGGlXXrACTOVTAAATTlCGAAGAAATGCTC^r^ 

GAGGAAATTGAGGCACAGAC^^TTCAATAACTACXS^^ 

AGTTTTTTCAGTrcCACAGCCTGAGCTCOT 

AGGGTTTCATCTGGGAACTTGTTAGATGTGCTCAC^^ 

GGGAGTAGQGCKyUXrAATCTAGTTTTAGCCCTCCAQG 

TGCCAGATAAGAAAAAAAAACCTTTCGAGACATCATTOSAATC^ 

AGCACTGTACTACTCATCTCTGCCTCCATCGGAGCG^ 

TCCCATITTACCAGTGCCXZAAAAGTCrTAGAAAAATGACMTC 

GTCAAAGATAAAGGGAGAACAGCATTTTCTTTT^^ 

ATTTATTTCACy^GAGCAAAGTCrGGGTXXa:^ 

AAATTTAACATGCIGTTAGTGCATGCAAAAAAAAGCOCi^ 

TTTCCTATTATITrACCTCAAAGCAAACACATCAT^ 

GTGCTCriTITrAGCAAGCITTTTGGATC 

TTTlTTTCTITTrAATTACAGGTTATATAAAATATT^^ 

GATOCGGACTGGCCACl<n<3AGCCCATCACrrAAA^ 

ITCATTTTATTIGAGGAGGTATGTTGTGGGCTTG^ 

TGGAAGQGGOCACTIT^CAaySCTTTCAT^^ 

CArrcAAAGCICCTTCTGTCriTr^^ 

GTTTATCTTGrrcrrTGGTTCCGGCC^^ 

AAGGAATAGAAGCTCAGGATAAACGTTTCAGACCTTTC^^ 

AAAGAGGCMOCATCATTAAAAGTTTGrEATGGATTCr^^ 

ATAGCCTTITCAAAAAATQCATGCAACTCAGACTATTO 

CTTGGAGATTATITGGTACCAGCACATGTAGCT^^ 

ATTTAATATATTCATrcCTATTTATAGACATTGAAGTTAGAat^ 

tcctgoltgmtaitrggtctgcttgagcaaggatat^^ 

actgtgtacacctitgaaacaattaacttcaca^a^ 

tccattcacacgtgagaatgcctgtggcccxrcagaaat^^ 

acaggtcccacaagtcagcaatc^«x:acqccatt^^ 

accacttcataacactgtgtagtcxyvatcxotaagc^^ 

tgtatgactttatctqcttaatacaaagtgggarrgaac^ 

tgatttagcritcaccgcggcatcttcttgtj^^ 
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CAGCCCIGTAATAAATGTCTCITOGAAGAGATGTGT^^ 

GGGGAAAAGCTTCAGGGAAGGGQCAGTGCAAATTGTTT^^ 

GGGTACAAlXXXnXSACGCCCGTTAATCCCyWXGCT^^ 

ACCAGCCTGGGCAAACAGCCTGAAGCMAAaamrAT^^ 

ATCTGTQGTCTX^GCTACTCAGGAGGATAAGGTGGGAGGAT^ 

TOnXKrCACTCCACTCXMCCTQGATGATAGAGC^ 

GTQGTGGTGGTCGTGGTAGTAGTACTAGTCAATGCC^^ 

TGACXSGCACTiGTGGGGAGCTACTCrrGAGGATGAAAT^^ 

GAACCTtXrrGCTATCTGTACATAGAAGAATGTITrAGTC^^ 

TCAGCATITCATCTOCyiTATACAGGACTGGAAAAT 

TCTCnriCCCACTGAGAGAATTITCTrAGCT^ 

ctcactgagcacctaccatgtaggixxtcatcxxxsa™^ 
aggaagcx:aatgtgagaaaacagaaa3agcaccaggcagtcgt^^ 
aaaacaacatacav'itcatcatttioctttc^^ 
agggtggtcattgactggggctggaticgggaaggatccactt^^ 

QfTcy^TTXXnXXyxriCroGGACTCAG C 

TGTAAC<nx3GCAGCrK3ACTTTTGTCa^ 

TTmAACTTAATCrraiAAGTCACATC^ 

TTAAAGTGGACAATTCATTGGCTTATCGTATAGTCACAAA 

TATCACCCCTAAAAGAAACCXrrGTACCCTTCAGCCAGCA 

GACTTTGTTICrATGACTTTACCrATITCGGGTAATT^ 

CATTTAATGTmCAAGGTTrGTtnTATITCATGTA™^ 

TACTTCATCCCCTTGTATGGCCAAATAATATrcTATTC^^ 

CATTTCCACTTTCrroGCTATTATGAATAATAC^ 

ATAAGCGTTGCCACAGGTTGCTTXirrGATTT^ 

ACTTCTATTTTCATTCCAATATCTCTTTAT^^ 

TCKy^CATC^lTTrAAATAAAAATAAAGTAGGGAAAGAAAACAGATAGT^ 

GAGGTGACATTTGAACTACACCCTGAATGAAGAGAGCCAT^^ 

CAGCAAGTGCyVAAGNGCCCTCGGGTNGGAAAOiCACT^^ 

GCCAACAAGGATAATAGACAACGCTATCAAGAAGGGGGCCCAACCACACAT^^ 

ATTTTCCXriTGACCTrATGGTGATCACTATTGGGAG^^ 

GACaXATXKnrcCCATGAGGAGGACAATOCGTAGGAGGCA^ 

agitaaagaaagcaactaaggtoaccaggtgacaataatggagatagagaaaacag^ 
actcagct^tattcaaatgtaattgtcagttttcgtc 
cacoctgactccx:agtgtgqgctggccaccx:ag 
ccacgtaggaagkkxraggttggagaaaaagcagtcacagctc^^ 

AGGCTQGATCATTACCAGCCTGTAOXrra^^ 

TAAATAAAGTCTCCCAJCriTAAAATGTGAGCAGAGTGAAGAGCTOGCAGr^^ 

'ITraZCTITTCaU3ATGAGCX:CGGTC^ 

GAAACriCTGTGTGCaiATCCATTATGTACTO^ 

ATTATTAACAGTTAAAAATAGGAAAATAGATITCAAGGGGAAAAAATTTGCAQC^^ 

GCATGTGCGTAAAACGCyATTKXCAAACATGAATAGGCAC^ 

TAATAOCCTAAACCCTGCTITCAATACAATGAACITGGGAAGC^^ 

CGGCTCXZATTCATCAGCCTGCATCCrGTTC^^ 

AGGTAAATGCGATGGGGACAAAAGATAGCAGAGCXCNGCSGGCT^ 

TAGGGAGCrreTGATGACGGGCAACAGATTTCCAAGTT^^ 

CACATCTIXXX:ACGCACATCC^AAGGGAGTGrIA^^ 

GATTCAGTAGGGATATTCATAACTGTGAATTIXa\ATATA 

TAACTATTGA G r iTiC XXrrCTCCATCro^^ 

GAATCTGGGGGCAAGAGACGTGTCnCrarrT^^ 

CCCrCTACXNACGCnGACTGGCAGTATTO:^^ 

AGAATCCTGAGCnXSACCCAAGACACACATC^^ 

GACTCnCAAGTAGCTGGGATTACAGGCATGCACCACCA 

CCATCTTQGCCACXXrrGGTCTO5AACTCCTGAC^^ 

ACGTCyUXX:XXnXX:QCCTGGCCAAGATACTGAGATCn^^ 

GGGAGGCTAAGGCTCXAGGATTGCTTGAAG(XaV3G^^^ 

AAAAATITAAAAAAATTAGCCGAGTGTCGTGGTGTQ^ 

TTGGAGCCCC3GGAGGTCAAGGCTGTATAOTAAACTGT<^ 

CTTACTCXX:ATCGGCATGCCrAGAQGGATCCX:AGT^ 

CATXXTTACAGCCTATTTITACAAAGTTTGAATArr^ 

ACAAATirm'AaTrACTCTCTAGAACXJVTTraiAT^^ 

GTGGGCCTGGAGTCACTQGAAAATGCAGACAAAACAAGAT^^ 

AGTAAA£a^GC:MGTCX:ATTAATTCAGGGGGGAlTrACAT^^ 

GTTCirrcXTOGAGAGATrAGTGATTGAAAGGAG^ 

ATGCAGCGTTCnCACGAAGGCTIXXTAAGGAGGCTC 

ACTGTTGATGTOCAGGGGGTGACAGTTCCATC^^ 

TTTATCTCTAACTCTAAAANCCXy^ACTGCTGGCn^^ 

GNATCTTTGTtn:ACTKXriTITTAAATTCAG^ 
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TGTGTTTGCTAGTAGCTTCOlTGAAACTerATXSGGl^^ 

ITITATTGAATGCSGlTAlCTCAATATATATCrrALUlT^ 
GTGTCXmKXrrcACGCClGTATTCCCAACAC^^ 
CCTGGTTAACACGGTGAAACCCTGTCriCrACTAAAAAGTACAA;^^ 
CAGCTACTCGGGAGACTGAGGCiWXyvGAATQGavTXSA^ 

catcccagccntggqcgacagagtgagacrcx^ 

aatattgatgagaac7w:ccatractctqctcattc^^ 

tttctcicgcatcagcttgcatct^^ 

atcccatacctctgtgtgtgttccttatct^^ 

cagccaaagctcttcttaaatacacta(Xx:atcaaaac^ 

agcatitoiagaggargagqcgggtggatcacttcaggtc^ 

tctctactaaaaatacaaaaaattagccagctgtcgtcxxacaa^ 

agaaccactggaaccrocxsaagcggaggtt^ 

agactttctcto^aacaaacaaacaaaaaacatacaac^ 

TKyCTCCTCTTGGGAATTXXGTAAGTACrri^^ 
TrcCAAGATTCGATTCTTTCCITKrr 

TGAGTAAGAACAAGCAGGAACAAACAATTAGAACATTTAGTGTGGCAAGGGGCGTG^^^ 

GAGAGACAGGAGTTGTTATCTGCXXXXSAACAGGTGAAGGAGTGAC^ 

CCX:TGAGAAGAGCAGGCATCAGATCXXXrrTTQGAA^ 

GGGGAATAAAGGGAAGAAAGTGCATTCTGJJ^GAGGGAATAGCAGGTAC^ 

TGTCTGAGTCrrcCTGATGAGACACGTGGTGrr^ 

GGTATGCGAACCTCTTCAltinan'rrAAAAGGGAGAGATTTAATAAAGO 

aaaagagtagttttcaggrigacctica^ 

gaagctacaactggagctaagcnagatggataagccaaaaaccaacctcx^ 

agctcwvgaagggacccx:agaaccggaagacaaaaaacttcc^^ 

agctggatatggcccttccctcacattt^ 

tgtctataaacaaagotgcaagggagtctggaaagggagtttta^ 

CXXTNAGCNAimXOTCCCXmGGGGATWn^ 

CAGGAAGAAWSTNGCTACTGTCATCTAATGGGTCAAGGNC^ 

ACCTACCACAAAGAGTTACCTACCCCCAAAATGT^^ 

GAACTAAATGAGTCAGTTTCAGATATCCAACACAAATACGT^^ 

AGCTAAAAACTAGTTACAGTrcATTGGATAATTGCCTT^ 

ATOCCACATGGCACCCCmTTGATTATTTCAGTGAC^ 

GCXSAGAAGGAAAAATAAAGTTTCTCrcCACGrrTACGTTTT^^ 

ITAATAGAACAAGTCCTNCriCAAAATGGTrcXCrrr^ 

GCTCGTTTTTCTTTTrcCTOZAGAGGCAGGAAGAGGGAAT^^ 

CCCCGGAAGCTCCATCrrTTGTTCTGAGGGC^ 

CTGGTrcCCTTTCACCAGCAGGTCCAGNCTCTA^ 

GCXACTCITCTACCCAGGGTCATATCXTKKS^ 

GGNAAACACCTTTCCAAGCIMIX^CCAG^^ 

GCCACATGCTGGATTGGTGTOTTACGTTATGAGAACT 

GAAATCATGTCACCCTCTGAATGTGGGTT^ 

Tra^ATTACAGATGGTTGCTGTACTAGATGA^ 

CATITCXSAAATAAGGCCITTTTAATTAAAAAAAAATT^^ 

CTAGTCTCAAACTGTTGGGCnGAAGCAGTI^^ 

ACCTT^CAGAAATAAGGTCTTTATAGATGTAACTGAAATAA^ 

GGCCXrrACATCCAATAAGAATGTCXZTrAGCAGA^ 

AAAGAlTGGAGTGGTCXXyVTCTACCAGCCAAOGAATGaJ^^ 

CATTTCTCCTGCAGAACCCCCAlSAAGCXyi^^ 

AGAATGAATTGCTGTTGGGCXrAGGTACAGTGGCTCATGC^^ 

ACTTCAACCCAGGAGTTCGAGACCAOCCTGGGCAACAAG^ 

GW5GTCTGGTGGGGCATGCX3mn'AGritXX:AGCTACa^^ 

aggttqccgtcagcntsagattgtocx:^^ 

aaaacataaataaatttctattgttttaagacaatttgcc^^ 

toiagct^ati™a^k3tacaacaaccxrixx:cc^^ 

ccctgatmcttacctataaagtagagattccctatctcat^^ 

taaagcatttggcatagtgcctgattcagagtaagtgacx:aata^ 

gaacataagtataomtaagaaagcaccatcagtattcaattoxx^^ 

aaagtagaaaaatagatgtcaattaaatggatotacatcttaaaaago^^ 

CATTCClTCrrcATCTCAACCCGTTAGGATTTG^ 

GCAAAAGCAGGAACATCATCTGCTTTOXCTAGTrAAGAGrr^^ 

ATTCTXSTTATCTTACTTCACAACTCl^^ 

ATTTTATXTAAATGTCAGATTTAATmAATCCATGACCAGAAf^ 

TTCCTirrcCATGTTCAAGCnXSATAACC^ 

TGAAAATACAGCTCTCAAGGCXXy^AGTCXrrT^ 

TTXSAAGACACCAGAAAATAGTGTATGGATiaTrGAT^^ 

GGTTTCroraiTCATOywSGCATGC^^ 

GGCAACy^GAAGTCATroaVTCTCCAAACTOITO?^ 

ACCATTTGCAAQGAGAATAGCTGAAAAGATACTTAAATAAAAGTGATCG^ 
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AC»TTCCCACUGGACCGTTACTC3CnCTGC^^ 

TTGAGTTKXaATCATAACAGCCATCTGACT^ 

ATGGATAGAAAAAGTAAAGCAATACATCCTGATAAATGCnxyi^ 

ATCCATAAGAGGGCAAGGCTGAG?WXCCCGGCCACCGCCGCTO 

CGTCAATGGGGTGAAGGGGGAATGGCCnGAGAAAGO^lGGA 

AAGC3TCAACCnXK:AGACATCnCCnX3AAGQAflCAC^ 

gagatcagtagaagagtacaataaaacaggaatiticagagcatgt;^ 

QGTGCAGAATAAGCTCTAGCXriTAGAGAGAGGATGCCC^ 
TAGGAAACTCTICGCCTCrCTAACACXIACG^ 
GTTGATCATCGGTAATGACACCAGOtfXIATAACACGAAACCCC^^ 
/CATCCCCAAGCXnTGCATACTTGATTCGAAAGAGAAGC^^ 

occcaltagtgttggttacgacottxtaaaac^^ 

gaagcctaaagtgcaaattcxx:aaacaagtcaagagac^^ 

tcxritaattatccatgttatttcaatacaaccctttc^^ 

aSTATCCrcaXXU^GAGGTrTCTGCXrATGCAAW 

TTOGTTCATCCCATTO3AGTTATGAGC3CnCTATr^^ 

GTTTACTAAAACCTGCAAGAGACCGCnTGAAAlTITCCO^ 

GAAATTTCGCAAACITCATCGAATTTTQCTAAG^ 

CTTICCTTTGGCTCAGAGACTTATTQn^^ 

TAAGAG^^AGATCmyyXSJU^QCACTCCCXATC^^ 

AGAAAGAGTTITCXrrCTTTTXXCCAGGCTCXSAGT^^ 

AAGCMTTGTCCTGCCTCAGCCTCCTGAGTAGCTGGAA^ 

TAGTAGAGATGGGGTTTCACXATGTTAGCXIflGGCTGGTC^^ 

CAAAGTGCTGAGATTACAGCKrrrcAGCCA^ 

ACTAGCynXXACANCTTCTCXXrCANCX:!?^^ 

ACAAAGAAAAATAAACAAGAACAAATOSAGAATCTTniCACTATGACT^^ 
TGTACTOTTNCCCATATATTTTTAAACTyVTTA^ 

agcagatctggataaagtcagtatacaatctggtct™ 

irGTTCAAGTTGTCTACCCTTATTATCXXXriCAGTG^ 

TTGTGTTAATATTATATATGTACATTACATOGTAAGTCXrrAAGTAAATGCT 

CTCriGCTGAATGTTTACATAGATGArcGGATCC^^ 

TATGTATATITAACAa3TAAGGAAAATGGGGriTICTGGAGGTGA(H^^ 

AGCKWGGTTTTAATTCAGAAGTTCTTTAAGC^^ 

AAAGATTTAAAAAACTGTTCATATCGTTCSVTCTACATAAATA^ 

ACOTTAAATATTTAATCTAAlTTATTCGTTATTTATrrAAC^ 

GTAAGCACTrACAACTAGTGACTCATTTAACCATTTCA^^ 

GCOCCACAGAGCAGTGAAAAGACTCCCGCCCTACTCCT^ 

TTTAAAAAAAATAAATTAAATTTITAAAAACATGACTITCriXXXX^ 

GAGAGGGCAAGACAGAAGGATCGCTTGAGGCCAQGAGTTrcAGACCA^ 

AAftAAATTATTAAAAGAAAATATATATATTTTITINAAATCATGACT^^ 

GAAAATTTWQGAAAATGTAAGAAGTCATAAAGAAGrTGGGAAAAAATTTAAATCCT^^ 

AATCAATCCAAGATTTTAGTGGAAAGGGAAAGCATATGATCTAGTAC^^ 

GTTTAAGAATCICAGAGTAATCCAGACGTTACTACTCCATAAGC^ 

ACTGAGAGCCCOCAACXKOTCAAATTCCaXSAGC^ 

GCTAOrATCAAGCAGAAGAAGCGGGATGClXrrCTACAGAGT^ 

TAAAGAAGCGAQGGTCTCAGACTITGTCCarmTGTr^^ 

TAGCATGAGCI7OTAACAGCCX:TCX:ATATriNA^ 

GAAArmX3GATCACTTACafrAAAAAGCT0ax:A0C^^ 

TGTTCCAACTCAAAGTCTACAGTCTAAAATCATCAT^ 

CAAGGTTTTATTAATGATtnXKTCTCAGGGGCTC^ 

GCTOTTATTGCnGCAGGGAATAAATCAGTITAAAGCAAGT^^ 

IXSAAATAAATTGTnMGTGCTTTGCTCATICr^^ 

TATTTTCTTAAAGGrrcCAGTATTCTTCCATTTra^ 

AGTCAAAGGGGAAGAAAAACAGTITTGAATAAAAACGGAGTTrCC^^ 

GAACGTAAGCAAATTTAAAAAAGCAACAAAAGCAGTTITrATGAAT 

CKnTITKXyiTOIIAAATTCTCOT 

CTmrCCTOGGCAGGTOCATGAGGGGGTOXIAGGCAAT^^ 

AAGGGTCAAGCAGCTGTTTGACAAGATTCTTGCAA^ 

GGTGAAAACTCAATOTIGAGATCACATACATCCT^^ 

AACCTTGCAATGGCTTTGGCCACCGTTTT^^ 

ACACAGGTCAAACATCXTCAGCCTOTTCTGTGGaCAGr^^ 

CCTTrCAGACATQCy^ACmXAAAACTCXATGT^^ 

TGGGGCATGCATCJroGTGACTTTTTCCATC^^ 

GGGCACTGCCTOGCACATAGGATATGTCCAGAAAACAOT 

TCAGTGACATCACAllXn'ATmXriTrATGGn 

CCTCCAAATTAATGTTATITCTAAAAAGTAGCCCGTra^ 

GACATCCTTAGCCAClXnTTAAAATAGCAAATTTC^^ 

CTCCTTAGCAGACATCATriClCTGTrTAAT^^ 

TGAATGAAGGCTTAAGGTITIGCCKXXnTGGlVl 

TTACATGCTTAATGAAAAGTGAGTGAATGGGCTGGGTOTGGT^^ 
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ACAGGAGGATCXXrrTGAGCCX:AGGAGGTCX3AGGCl^ 

AGCGAGANCCrTCTGTCTAAAAAOTATTrATCTA^^ 

AATCTTTATAGAAACTCAATTCAACAAACATIT^GCAC^^ 

AAAGCTGCXX^AAGGOSGAGATCCAGTGCTCAGGAAGCTC^ 

AGGCATAGGTACAGATCIGTAAAATATGAAAACAGGAAAACACATGGAG^ 

GATTGGGTCTGAGATAAACCTAAAAGAGGGTCAAGATr^^ 

TITTTCAGATCTCTTCTTGTGCGTOGCCTC^^ 

ATAAAGAGTCCrrGTCXnTCCOOTTTACTCT^ 

CaX:ACACTTTOnx:AGAGTGGGCTTQGCATA^ 

TTGOCCCCAGTQGAGAGACAGGAAAGCAGGGAAGGGQGaATTG^ 

GGTCMGCTGAGGCIXSGTCXAGTTACTCTCTCX^^ 

CCCCXACTKyvCCTCXriTOSTGAAA^ 

GCGTTCAGCTAGAGGTAGAAAGATGGCTrcAGGTGGCTGGGT^^ 
CAAGGCXSCKnXXSATGACCTCAGGTCAQGA^ 
CAAAAATTAGCCAGGCATGGTGGTGTCXXXXriOT 
GGGGQGCAGAGGTTGCAGTCAGCCAAAATIXXZACC^^ 

ATAAGTAAATAAATAAAAAATAAAIATAATACAGACACAGGCrrriCXCC^^ 
AACTtftfXriXXrrATTCCTGTTTCXrr]^^ 

taagtogaaaatcx:acatttaataacatgagaataatgtcc^ 
actcttqctgcctgcagaaagagacactcag 

GTCGGCTCTGAGCAGCAGCTTCOZAGACACrrc 
CAGAGCAGACCTGGCTT*AGGA)rcCCAGCanACCA^ 
CAAGCCICACmriCTCATCTGTAAAGTAG^ 
AAAAGGTATCTAGGCTGGGCGTGGCGGCICACGCCTGTAAT^ 

GCATCCATCrcCAGTCCTCXXn^'ACTCAG^ 

CGAGATAAAAAGACCTATCTCTAOW^TTAGCTGGGCATQGTGTC^ 
ACXnOoGAGGATCGCTTGAGCCCCGGGAGGTAGAGGCATCAT^ 

ntctcaaaggaaangaaaacaaacnaaacaanaanagc^ 

agao\gtggacacngtggggccatttcaaqctaaatta^ 

attancaagtgcgaaacatccggacacatctagtggcixxc^^ 

gtagagtotoccatkkiacagtttgggtatg^ 

ttagttttattggccaggcatggtggctcatgcctgtaat^ 

cctaggagtttgagaccagcctcggtaacatagtg^^ 

gtggctcacgcctgtaatcccagcactttggg 

ggctaacatcgtgaaacccxatcrrgtaatraaaatgcaaaagatag 

actttggagqctgaggcaggagaatcacttgaacccgw 

gca;w:agagcaagactgtctcataaataaat7wuvtaaataa^ 

acttgngaggctnaggtgggaggatggctt^^ 

AGTCTC^TCaCAGAGCNAGACTCTGTCrcAAAAAACC^^ 

ATAAAGATAACATCCTAGACACTTAAATaWSAQOCTTT^ 

TTCCTCaVTATCCAAAAAAAAAAAAAAGGAAAAAGGGAAGAACX:AAAGGGGC 

TAGGGACAGACCXAGGTTTGAGAGTTCCnXKrnx^ 

CAGCCCAGCTCTICGTTAATTACCK3AAGTGAATQGGACAGT^^ 

TCXXySTACTATCTATTrGGCCTTGGAAAGTTACT^^ 

TCX3AGAGCTGGTTAGAAAAACACAAGTCrrCGGGCCXXA 

QGAGTCTGCTTTTCTAGCAAGCXSCXXAGArcTC^^ 

AAGTCTTTTCATTTGTAAAGAGTIXmxnCCATC^^ 

GCAGCTGGTGACATTGTAGACACTCAGCAAATGGCCTTTGGTC 

CTACAAATCAGGAraXOTACAATCGCTTGATATTTGAAAAT^^ 

AACAATCTTCTATrrAAACTAGTAAACTTAATAGTTTA 

ATGGCTCACACCTITCAGCICAGTTTTTT^ 

CAATGAATTATTTTTCCTTTAQrcGGGTCTGGTOSTO^^ 

GGATACCACGAQCCCAGGAGTTGGAGGCTGCAACGACTrAC^ 

ACCCTGTCTCTCAAGATACAAGAAGQCCACATGTG^^ 

AAGATTGCTTGAGAAGTTTGAGATTGGCCTGGGCAATATAGCA 

AAATTTTAAAGAAAQGTATTTCTCCCCTTCACGGTA^ 

GC5GACAAANGGANTAATGCCTTNNAATGCCMOT^ 

TAGGTGTTTTGCAAAAACATACTITCGTGACTAAACATCAT^ 

TCTGTCAGGATAACACTTCTAAGATGTGAGAT^^ 

TAGGAAGGATCGTCATITAGRACCTGGGTTCn^^ 

GATTICCAGTTaCCAAAAGAGCAGGTAAQGAAAACACTGAGTGTT 

ATTTTTAAAAAATAGAGACAAGGACXnXSGGCGCGGTGC^^ 

GGATTCXrerCAGCTCAGGAGGGCGGGTGGArrcCC^^ 

CATCTCTACTAAAAACATTAAAAAAAAATTTAGGanGGT^^ 

ACTAGTATCACTTGAACCGGGGAGGTXKSAGGTTGCAGTG^ 

GGAGACICTATCTCAAAAAAAAATAATAAAAATAATAAAAAAAAAATT^^ 

TGCCTAGGGTGGTCTIGAACTCCTGGGTTCACGTC^ 

GCKXXZACGCCTCACCTCTIGT^^ 

TATCCAACAGTCGTGATTTAGGGCATGAAGAGCTCA^ 
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GGCGTCTITCGTCACAAGGAACAGAGTTITACTTGTGC 

CTGACTAGCTOATTCIXXnXXrn^^ 

CTO^TGACCGTGCAATTCAACTCAAOTCAT^^ 

CCC:AATTCXXrrANGaX:AGAAATCCACTGGCAT^ 

TTTGCCCTIXXXrrcATGTATGATTGGACClXXXKT^^ 

ATACAAAACCXACXrrAAGTAGAAGGACCTAlTSAACCATGAGGGTT^^ 

TGGTITICCAGTTTAGGGCCAAGroAGTOS^ 

ACCXTACCCATTTCXrAGTrCCCTCAG^ 

CICACAGCTGOUWZAraxnGGGGTOCSAG 

AAAGGACICTGCCCTCATXMATGGGGTTGGGGGACC^ 

TCTACTTITrACyKXXSACGCATCATACT^^ 

TATGTTICAGTGAAGCACrrTATAGCTrCAGAT^^^ 

CaGftQCATAGGAACACTCATAGTATGTATCAGAAACTAGTGAACT^ 

CgiayXXrrCTTIXX}GGAGTGTTCTC^ 

TCTCTGTGCTGITCNACTGTAACATCGTCC^^ 

CAGOGTGATGTCTACn^CAGAGTTTCCATTCT^^ 

CCAOCCTCAGCCAGGGCAGGAGTATTCXXrrAATGGTAGA^ 

ATCAGGCXnOKXSACGTCCCTTGGGAAGCTATTCI^^ 

AATAITCAGAGTCTGACACAATAAGGCCATAAAGAGCTAAGTCTGT^ 

ATAGICACTTTTGTTATAAATCTTCACCTAGCATQGGAG 

AAAATGCAAAAATAGCCCXrCAAAATGAAAATrAATGTTGAGAATCT 

TCTCTATATTTOICATGGCCATATATGTTTTCCTTGI^^ 

CTTAGCACAGAACTNCCTCTCCTCaTItnTTGA^ 

ATCTTCGClTCTGTGNTOTCrrcCCATGTXna^ 

TATTGTTGAGGGAAAGATCTGAGAGCAGQGCCAAGAAAOGGAAAGAGAGAGAGAGGAA 

AAAAGAG^^AGG^^^GGGTAGGGAGGGAGGNAGAGACATGGACC<:7^^ 

AAATCGAAATAAAACAAAAGGGCAGGAAACXAAGAAATAAAGAGAAT^^ 

TGGGATCTATAGATTGGGGAAAGACATTCATACNAGTCNAGGACC^^ 

TCCTTCmTmiGCTGTANGGAAmnWl^^ 

AAATTrcrr^m•AGNAACGNCCAGGCG^ICGG10GCTCAGGCC^^^ 

ACGAGGTCAGGAGATCXAGACCATCCbrrcGCTA^ 

GGAfiTTOX:AGTTAGC:7lGAGATGGTGCCACrGTA^^ 

AGAAATCTCTTAGAAATGCATATTTTCTXjTTTT^ 

CTATTTGTGTTTTTGGAAGAACTCXrrGlTITAATI^^ 

TTTTTCCGGGAATCCAGITCTTTTAGGAAAGGCCC^ 

GCCTTCTGTAGCTATCTAACAGAGACAGAGTTCACTrGC^^ 

AAACTCyOXAGGAAGGa^TGriCTTCTT^ 

CAGTTTCX:ATATCCAAGCAGCrAAGGCTTGTGC^^ 

TCTTTCCCCAGTOGGCACAICATGGCCTU^^ 

TAAAACACAlTnTiTn^CCAATGAiCy^GTACTGATGACATAAAAGGTTATA 

gtttcatatttaa^ctgttcaacacagttctc^ 

taltggaaatatgaacccaatatgttcttaacatctctatgta^ 

attttagatraaa<k3attctacctatcgaaagaagagaatcga^ 

ttgttgaaggattigttctatok:atgctga^ 

tttatgatgcagcttctcagggcgtcctaato?^^ 

actttatgaggttgggtaatgaagagagagagacccx:^^ 

AACCCTTCXnrAGCACAGCCICTATTACACAGTflC^^ 

TTAGGCATTATCnTATTCTAAATTAACACTGTAlTTCATAT^^ 

GGGCTTATTATITAGTAATCCCTGAAGCTCICAAGTAGGCGT^^ 

AGAAACGCATtnTAAATGGCOy^CTCCClTIOT^^ 

ATCTAGTGrarrACAAATGAACTTGCTACTTCGTITGCT^^ 

TAATTTTTGAAATOTXirTrAGAATGTCACAGC^ 

TTAAAGTTTTCTGCAATAGAAATGCATTTCAAATATAAGCTAG 

TGACAAGATGTOCAAATGTTTTGAAACGCAGTGAT^ 

CTTTATCTTTAGATCTTAATCAGAAATAGGAAAATm^^ 

TGAGTTACTTTAAAAAAAAAAAAAAAGATCCTGACAAATCC^^ 

CTTTCn'AITrTICCTCTCTCTATTG^ 

TTACACCCCAGGACAGGTCCCCTCCTCAGCCK^^ 

AGCCACTGCTTCTCTACCGGCGCJ^TCCnXXX^ 

TGGGGAAGGGGCAAGGGATTGAGGGAGAGTGCICCTGGCATC^^ 

CCACCCAAATCTIO^TCriTGAGTTGTJ^ 

ATGTCTCXIAGTTWaXCCATGATGTTTTC^ 

TTTCCTGAGACCTCCACAGCCATGCXrA^ 

GTCTTTATTAGTCQCATGAGAACAGACTAATACAGTCAGG^ 

TCTACTTCTACXXXACCAACAAAGAATGAITCAGCC^^ 

TACGTCACCATTGGAAGGTTGCCAAACTGTGCTTTGCA 

GTCCTGCTCATCXritriCTAAACACCCCAGAC^ 
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AAClTCTGTCrnCCCTrTTCCCICT^^ 

TAGACCITCACCTKXrrcCTCTCT^^ 

GAGAATCCCTTCACCXOTCATCTCTGATO^XTTTG^^ 

crxCTAcrOTPtyixxsTTCcmrc^^ 

AATCTCCAGCTOGCITTATTATATTTCGAAGTGAGC^ 
AATAATCCTGCATAAAGTCTTCCTTACCATTITAACAAATGT^^ 

CCICAAATGATTCTCCTACCrCAC^^ 

ATTmNAGTAGAGATGCGGTTTCACCATGTT^^ 

GGCTCXX:AAAGTGCTX3GGATrACAGGTGTGAGCCACCG^ 

GCCAACTTirrGAGGACXATGTCnTCTGTAGGC^^ 

GGCCnXSGTGCXrrcATCKXrrGTAACCCCAGC^^ 

TGCCCTGGGCAGCACAGTGAGACCrCGTATCTACAAAAAATAAACAAAC^ 

CCTAGCTACTCAGGAAGCTGAGATCX3GAQGATTGCTTGGGC 

lOTACTCXrAGCCTGGGCGACAGAATGAGACCCTGl^^ 

TGAAAGCATTIGGTGGAAATAGGGCACACACAT^ 

CTTTCCXriTCTCCCnCCATCrGGGTGCTI^^ 

TGAATTATTTAAGCACTGCCAGTCyVTTCTATATAAACXGTTCAGTAA^ 

CAGCAGOCTACCCAGGATGGCTTGATGTTGTCTC^ 

CGAAGCCTTTGTGTTGAGGTTTCTGTAGTGTra 

TAGACOCXCTTTGATAATCTCCACTAAQCAGACATACTO 

TAATCWrnCirrTTGAAAGTAAGAGTGCTTTC 

TAITICCCCX^AGTGTITCTGCGATAAGAGAATAGAATGTAGGTCCCA^ 

GTTTGTGAATnXXAGGrrrCAGaCTGCCiaST 

TTGTAAAAGATTTACAATTATTTCATTTTCAACATAGCTI^^ 

TGGarrACAGGGTGCTTTAGAAAACATCTGQCiaSGC^ 

GTAGGCrGAAGTGGGAGGATGGTTAGAGCCTAGGAGTICGAGAC^^ 

AAAATAAAAAAAATTATCTGGGTATAGTGGTC3TGCACCTGAA£?n^ 

TGAGCCTAGGAGGTTGAGAGTGCAGTGAGCXIXn^?^^ 

TXrCAAAAGGAAAAAAATAATAAAGCACTGICICTCTCTACCCTT^^ 

CAGTTTATAQGTGAGTGATATGGTTTCGATOn^ 

TGTTGTC3GGAGGGACCCAGGGGGAGGTAATIGAATCAa<X3G^ 

GTCTCATGAGATCrrGATGGGTGTATCA(3GAC?I^^ 

AGTGOnTTTGCCTACCACCATGTTTCTGAAGCCI^^ 

CCCAGTCrcAGGTATGTCTTTATCAGCAGCGnrc^^ 

TGGTTGTACTICAACCTGGAACTGrcTAACTCTrAAGGTTTO 

TG(5AAGA£3AAGCACCAACTTITCTTTTCCAATT^^ 

TAGCTTCXrTTAOCAGTTCCCiWAGCTTCAGTC^ 

CTCCXZATTTACAAGGAArrACAGAAGAGTGATXXXXXXrnCAAAG^ 

TGCXrrXXrTTTGQGTTCTAAGACGTCCAAGGAGCTCTTAT^ 

TAAGGGCXrAraTQCCTGAATCTCTCACCCCCATGTTrATaA^ 

GTCACCAGTCGAGTOrCATAAATAGGlTCTTTATGATCCC^^ 

CTGGGGTATGAGTGGCCCCAGGTTTCCCTGITGQGGGTACTQATA^ 

TATTITCTTAAAGAGACAACTGTTTTTGGTGAATAGC^^ 

GAAGATATCGTTItXIATGGTGTrrTAAAGCTAAAACTTC^^ 

GNCTA1NGGGGT^XATNNCCACGGAAGAG^^TINGAAAGC^^ 

TTGCXIATCCAGAGCTTGCCAGCAACCTrTTAGAT^^ 

TTTGTTXXAGACAATGTATATTTAAAAAGCAATTATTTCXCCA^^ 

TTOn<3GCrrGGGCATIXnTrATCTCCACC^ 

CXXnCCCCACCACrrACCCACAGGACCCTCTGCATAT^^ 

CCTTTATTACTTCTAAGTTCCAITXXIAAAAATATCCTE^^ 

TTCCAGCTTXrrATTTTTITCCACATAATCCCTAAC^ 

ATACAOVGAACCAAATAGTXIATAGAGTAGTATTTCAGGTCX^^ 

CXSGGAACATTTGGCAATGTCTGGAGACArrGTTGGTTGT^^ 

AGGTCTGCTACTAATaxrCCICTATTGCACAG^ 

CAAGGCTGAGAATn?rTGATCrrAGGGGAAGATGTC3AT^^ 

CXnxnCTTTTraiACTKMGTTGTGGT^ 

AGATCyrAAAAAaSATTTAGACGTCTGGACGTGGTGGCTC^ 

GATCGGTTCAGGTCAGAGTICAAGACrAGCC^^ 

CTAGGCGTGGTANGTCmxnGCATATAATOCaw«X^ 

AGATCGCACTACTTCATTCCANGCXrTGGGCAACAG^^ 

GAAATOH'ACAGTCAGAOGTrAGGGGTCaSGTCAAAGCGATC^ 

GCTGOSACCATGGTCmTVCTGCT^ 

CCTGAAACAAhTTCCCXriTCCCATrGTCAAGGC^^ 

CTTTCCATTGCTTTNGCTCX^GTATCXmT^^ 

GCAGCCmX3CATTroGram3NrATATOJGAACNA^ 

CCANGCAAANGGCAritCTGCAimJQCCCATGTN^ 

ACAGAAGCATATGGCAGAATrcATATGTGAACAGATKXOTTTT^^ 

CAGCTmTGJUtfSTTACTCTICeroCT^ 

TGCATTCTGICTAmXSTATCnTTACATCTGC^^ 
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TIGTAAACACAGGGAATAGATGAAGCAAGCTCXnTCTr^^ 

ACaXXCrGCTACTTTCATTrACCATGTTACT^^ 

ATA^myACATTT^5G^m3TC^IAAAGTACTGT^AAGGTAT^^ 

TTATTATTATTATrATTGTGTTTACATCCACAGGCAATCACA 

TTITTCaCTCACTCCAATTTCJWVI^^ 

TCTTTCATGATAGAAGTAGGTCATGQQGGCGGTT 

ATGCnCIXXXMGGAGAGTTCAGCGC3QCTGGATCAGAGAG^ 

AGCCAAGGTCACAGCIXXSAAATAAAACnnOTATira 

TCAAGTTCTCAGCCAAGAGAACCAAGAGCCAGAavCAGGGA^ 

AGCATTGACXAGGTGCtXAATAirrTAATGTCCCATGGGTGT^ 

AAAQGGOCTGGACTGAATTAGACCAGGGGTICKyi^^ 

GTTAGAGATGCAGATCTTXXXX:CCCGCCCCA£^^ 

TGTAGGCTGCAGTCTGTGTTAAGCTTICrcGGT^ 

ACAlAGGAGAGAAGCCACTAGGGGTGGGAATCAGAAATACGGriGTCCC^^ 

GTCnrmAACTCTCTCTTGCrKSGC;^^ 

CGGGATirnCTACAGCTICATATTCSGCTrTACCA^ 

tcxtaagtgacttcagactctatagacagtocctg^ 

gaaaaccatacacatcxagtagaaaccgaactitaca 

tcgtaggatactcictcacaatgttaggcca^^ 

ticwiaccaggcacggtgltxxctgataacc^^ 

titaaatttccacctcttttggggggggttow^^ 

ctttggtocacccatcactcgagcagtat;^^ 

CXXTCnXIiAGriCCCCAAAGTCCArrOTA^ 

CACACyWJIXXriTGGTrTTCCATlCCn^ 

CCATCAGTTCATrcCTTTTCATGGCnxiAGTAGTAT^^ 

GATGGCCAriTGGGCTAATTOACATTTrTGCAAITG 

ATATAAGQUnTCTTTTCCTCTGGGAAGATACOCATr 

CAAGAACTCTCXaw:j^TGTriTCCATAGTGGTTG^^ 

^KXGCATTCACaXAATATCTATTArlTmtX:ATTTO 

GCAACAAAAACAAATAGGTCGGACTTAATTAAACTAATCAGCTa^ 

ACAACCXXSCAGAGTGGGAGAAAATCTATACAAATCTATA 

AAATTAGAAAGAAAAAA(XrrAACAATCCCATCAAAAAGTG<XXrrAAGG^ 

CAAATGACCACCATACGTGAAAAAATCXrrCAA 
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Figure 13 

GenBank accession number AF217492 (SEQ ID No 52) 

tcttagttgacatgtcattatggattcttgggcataaacgtttatatgaattttgaatattaggaaataatcttggaagc 

tatattagttttctaaagctattataacaaattaccacagatggagtgcctcaaaacaacagaattttactctcttacag 

tttcagagaccaaacatctgaaatcaagtggtttgcagcgttggctccttctggaggctggaaggagcatctgttccatg 

cctttctcctagcttcccatgactgccagccacccttggaatttcttggcttttatcttagtcagtctaatctctgcctc 

tgtctttccatctccttctctgtgggtgttccttcCccccttctgttttgtaaggataggcctgtcattggatttaaggc 

ccaccttaatccaagatgatctcattttaagatgtgggacttaatcacatttgcgaagaccctttttccaaataaggtcg 

cattcacagatccctaccattcaactcactatggaagcaatgggatgggaggtaccattcaactcacagattcctaccat 

tcaacccac tataaagcagtggca ta tgaaaaggatgggagg ttgc tatgggaa tggtctgccccaga tgcaggtaaaat 

ggggaagca 1 1 1 tata t agaa tggaaaaacaacag taaagc tggc taaaac t cga tc tgc 1 1 1 1 tacag tea tg tgc tga 

caattctaaacattatcaatgataaaatacttctcccccatgatattaactgcgcaacttttgctgcccctgccacatac 

atgctttattgcttggaaacaggactacttggtgaaataatgtgaatctttcaattctcatgatacatgttttcaactta 

atatccagggtgcaggaatatatattcccaccaactttatacaagataatacaaatataatagtatgaataaatatgcta 

atttgatagaaaatattttaatattcactactagtgtagttggacgtgttaaatattgcaatggttatttgtaatctacc 

tttataaaatctctaatgtctgccactgtttctgttagactcttaagaagatacgtgtttgtctcatagatttgtaagaa 

ctctttatatactaaaaatatggggcttctatatgattgcatatattcgtcacttttgttccactgttttagagagcaga 

agattttaatttctataaagatttttatgtttccctttattgcatgccaatgtttgttttttaaaaacttgtacgtctta 

ggctgcgatgtgaaaagtgtcttttttttgatttggaaacagagtcttgctctgatgcctgtagtggtatgatcatagct 

cattgcagcccttgaactcctgggctcaagtaatcctcctatctcagcctccgaagtagctgggactacagccacacacc 

accacacctggctcactttcgggtggttgttgttgagatggggtcttgctgtgttgctcaggctggttttgaatactggg 

ctcaagtgattatcctgccttagcctcccaaagtgctggaatttcagaggtgagccactgtgcccagcctcctgtttttt 

1 1 aca tgta ta tatatatata tat a tatatata tata tatatatatatatatatacacacacacacacaca tata tata t 

acacacacacatatatatatacacacacatatatatacacatatatacacatatatatacacacatatatatacacacac 

acatatatatacacacacatatatatacacacacatatatatatatacatatacacacacacacacacacacacacacac 

acacacacacatatattttttttttcttttagatggagtgtctctctgttgcccaggctggggtgcagtgacatgatctc 

tgctcactacaacctctgcctcctgggttcaagtgattctcctgccttagcctcccaagtagctgggattacaggcgtgt 

gccaccacgctgggctaacttttgtatttttagtagagatgggggtttcaccatgttggtcaggctggtcttgaacttct 

gacctcgtgatttgcctgcctcagcctcccaaagtgctgagattacaggcgtgagccgccatgcctgcccttatttttat 

atatcttccataacctttctatctctttctttctctttctctgtttctgtgtgtgtctgtgtctgataataaaatacatt 

aaattttattttataacataaatgttttataaggtcggtagatttgtttatgattttaatggctgcatagtattctatca 

tatggatatattctaaatgtaataattcctcaatggtatatttagtttactttatatttgctgctttagtttgtcacaaa 

cactgtggatagccttgtatatatttattttgttgtatagttcttattggtacagtatgatttttcacaagtatacaatt 

actttctgagaatttagctttttgattcacgtgaaagccttacttgtattgttaacaattttgttacgtttgtgaggcct 

aggcagaaggattgc t tgagaccaggaac tcaagaccaccttgaccacc t tgggcaacatagggggagcccatc tc taca 

aaaa taaaaaa taagaaatciaaa tagcca tgcatggtggtgtgcccc tgtagtcccag t tac tcgggaggc taagg tgga 

aggatcgtctgagcctgggagtttgagactgcagtgagttatgaccacaccactgcactccagcctggatgacagagaaa 

ga t c t eg t caaaciaaaaaaaaaaaa 1 1 aaa ta t a tacaaaa taagggagaa t ac t a tgacaggc t tccac ttacctatct 

ttaggcttctttataaaatgttcatttaatctgatctctataagaaattctgtatccttaaacaacatgtgcccttctct 

ttacacagagcgattacccccttgctatgttttattaacagctagcaagtgctgggtcctgtgtcagttgttttgttttt 

tctttttctttgtttttcttttcttttcttttcttttttttttttgttttttttttttttggagatagagtcttgctctg 

ttggtcaggctggagtgcagtggcatgatcttagctcactgcaacctctgcctcctgggttcaagtgattctccggcctc 

agcctcctgagtaactgggattacagacatataccaccacacctggctaattttttttgtgttttgacagagatggggtt 

tcaccatgttggtcaggctggtctcaaactcctgacctaaaatggtccacctgccttggcctcccaaagtgctgggatta 

caggcgtgagcccccgcgcctggccctgtgtcagcctttttaacagcataatttttcctttattagtcataactacttta 

agggc tat 1 1 tga 1 1 tacagacaaag tgagac t tggatgaa t taagaaagaat taatcagg tggt t gaacc tgagt tcca 

actagggtgacaactcttcgcagattgcctgccgctgtcctggttttcacactgaaagttccgtatcctaataagcctct 

ca tec tcagcaacggaggacagttgcccac tcaaagcc t tgtgacattctaggg tatcc tat t tc taca tggtgggaatc 

tagaagacggagaaagaatttctcattcccgaaggageatggattatccttggttgtagtgtttatgttccacatcacgt 

ggattcccgaaggagcatggattatccttggttgtagtggttatgtccacatcacatgggatattttatttttcaggcet 

c t tea tg tgc t tg tgtgcaaegcagcaae t tttgc tc taeectggagtcteaecaaaga tggcc t ggagaccacc 1 1 tea 

agtgaatcatctggggcacttctaccttgtccagctcctccaggatgttttgtgccgctcagctcctgcccgtgtcattg 

tggtctcctcagagteccatcggtgggtttgaattgcatatttgttcacttatcccctttctcataccagctaatattcc 

eccaaggc tc teat tc tgaaaataattt tea t tag tec tgc ttgagaca tgtgggtggac tcagc t tggc tcacttaa 1 1 

ttteeaggtettttttgttcgcctgcgattgtgggggactgtttagaaggactttctagagcaaggaagattgcctttac 

gacta tac ttcaage tec teat tga ttttcgct tacagatggaataataact tea tgaaaaac tcaatggcatgaacc ta 

ttat tgga t ttgtaa t tcaacaact tcaacate t taccaagaagaatgtgcagt tat tctageaggagaaacciatgcaat 

tagagcctgegagatgaaatcaaattgttttataatgagaaattagggaattcgaggcagacattagctgtgtaattgtg 

gaaagggaagaac tgtagttagagca tat tagaaa tc tggccgtgcctct 1 1 tggt taaaa 1 1 tcaat taaaacatcag 
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Figure 14 



Genbank accession number AF2 17491 (SEQ ID No 51) 



gcttttgaaattaagatattgtccattgcatctttccatgatcgaggcgttgtcgtgcattcccctccctacagcgtttt 

aggacgcagat tgagaggaga taacg 1 1 agcccgaagcgcc 1 1 tgcagcc tc 1 1 1 cgcgcggc cgaaacacagcacaacc 

cacccccccgccccaccccccagcctgtaggtttagcagaaccccagcctcacatcctccccgaacttggcagtaaaagc 

cctgttcttccattcaccccgatgatttatattctctctgggcgtcttatattaaacaggggciattccgacatgttccat 

aacacaCCtactgtaactCgataccatgaactacacttgctgttatttatcatttctttttattttctctcattgcagca 

taaagccaaggtagaaacaatgaccctggacctcgctctgctccgtagcgtgcagcattttgctgaagcattcaaggcca 

agaatgtgtgagtgttccagtggagggttatagatcataatttcttgctattgtaatatctttatcagatgaacacaatt 

gggagaatgcaaggctgttgtgttgtcttggcgtccaaacaggaggctcatttatattggccctgttaaggtgaaccgta 

1 1 1 tc t tgactcacagtcacc t tea t ta tgaga tg tgtca tcaa tc taa taacagc t tcccaca taccaaaagagaagac 

actattaaagcactagtaaaagtggctaataaaagcttggcaatagtaagatgcatcctgattataagattttttgtagt 

gcatttcagaatggagtaagagtatatttaaattgcattcaggaacaagtaaactcaattatccaatatggcagggaggt 

tgacaatccaagcacccaaaaaacctctagtttctaaagccttcgatgatttgatgtggtacatggatgtggttccaaaa 

aacatggactcacattccttttatttattttttttcatccttttcagtctttcaaaattccagttggagaaagccttagc 

tagggcctagcatatt t tgatcc t a tea ta tgc tagcatcce 1 1 tc taacagagaagg t tgtaggagaaagggagagaag 

cggaagggggtggggagacagagagacagacaggaggcc tcaaaccc t gaaaeac tgagc C aaggaaag t ga tea t ggca 

agctacactaattacaatactttgtttecaagcgtctatttttactcacatttagggcaggcaatcctggtttctcgttg 

aacatagaggtttgaatttcattaataaataacttcatttactttttttcagtgacttgattcaaacatgaggattaagt 

taa taa tagcacaggt tgtgcgaagga taaga taa t tacacaagaggcaccagaaecac tgaa tg tggagagc tc tea ta 

aatgacaagctgcccttgggttaggctctgttgggaacattagttctgcagtgttgrcaagcagatgaaggatgtgaggga 

aggga t c t taaaccagatattcaaa tggece tg tggggagc tgaeaeeacac tgc tg tc tagtgtccaa t te tec t tgca 

tggctgtgtcacccaggttggaacgtagtgcacaatctcgccttaatgcaacgtcccaetgcgggcccaagtgattttcc 

tgtcccagcctcctgagtagetgggaetaeagatgcctgccaccatggcctgctaatttttgtatttttagtagacatag 

ggtttcaccatattggteagactggtctcaaactcctggcctcaggtgatccacccgcctcagcetctgaaaatgctggg 

attaeaeacatgaaecactgcgcccagecgctctaetttattagatttaaaaagtttgctctcagctgggtgcagtggct 

tatgcctgtaatcccagcagtttgggaggecaaggcggggagggtcatgaggtcaagagatcaagaccatcctggceaac 

at tg tgaaacceca tc t c tac taaaga tacaaaaaat tagc tgggtg tag tgg tgeacacc tc tag tec t age tae ttgg 

gaggctgaggcaggaaaatcgcttgaaectgggaggcagaggttgcagtgagccaagatggcaecaetgcactecagcct 

ggcgacaga t tgagac t ccg tc tcaaaaa a a n na a aaaaaaaa a aa n a ag 1 1 tg tac tcagcc tggcg tggtggc tcacg 

cc tgtaa tcccagcac 1 1 tgtgaggcegagg tgggcggatcacetgaag tcaggagt tcgagaccagcc tgggcaacaag 

gtaaaaecccgtetctactaaaaatgcaaaattaactgggcggggaggcacatgcctgaaatcccaactacttgggagge 

tgaggcaggagaatggcttgeiaoccgggaggtggaggttgcagtgagctgagactgaaccagtgcactccagcccgggtg 

acaagagcaciaac te tg tc tcaaaaaaaaaaaaaaaaaat tgac tcaggct e t tgc tggagta Cgtcag tg teeeaag tc 

at t gaggtc taca t tagag tcagtc t tag tgagagt cgeaacatcgcctaagcgcagaccccaagc tggc t tg taggaac 

agtgagataaeatcccccagccagccagatctgagagagccctcctgtgtgtgtcctttgeagcgcttggctgttagtag 

ct tccaccct t tgccagaactatcaagaggcacc tcaacaggctgcaacac tagcattcaagacatttgttcgc tgat 1 1 

gcttcgtcattcatttatccaccggttccttcactteacacacattteaggageaectetaaagccceagaeattcttct 

gggctctgagaagagagtggtggccctggaagecaeagtaaecgcacacaagetgteacctcagttgtcagetattatgg 

atggc tgac tec tcaaae tea tetgte tgtggttgtttgggagggaatgataatgacaagaacagcagcggag tcagcag 

ctcatttactgagggcttatgggccatacgtggaaaatattttgcatgtacgttttgatteaattcteaaageagtectt 

gggagtagatgctattaatatctctactatacagataaggaaagtgaggctagagaggatatataatttatccaaggcat 

gtggaatgtgggtagaggggccgatttagcetettcacggctccaacttgaeeaccg tae ta tae tgtgtt teat tt tag 

tttattttttatttattttttgagacagggtctcgetgtgttgcccaggctgaagtgcagtggtgcaatctctgttcatt 

geaaec tc tggcc t tgggt tcaagcgat tc tea tgee tea tec tccagagtagc tggga t tacagg tgcaggt taccaca 

caeagctaatttttgtattttctgtagagatgaagtttcaccatgttggccaggctagtcaactcctgacctcaagtgat 

ctcteeaeettggeetcccaaactgctggaattacaggcgtgagccaccccgcccgaceetttcatcttaaatgtgatag 

aaacagactcagagaatgtgagtgacttttcgaagggtcactagcaagttggtgacgggcatggggtttcactccagtet 

gcctcttgttcttttcctcatagcatgtgagatctcacaattcactgccgctgaataactttgtccagttcctetgtcag 

aa tgtgt tggaag tgagcagct t tgc tgaagagagccaac tgt tacataataggga tea tc t tceacg tc ttaac tggac 

cattagtttttaaceattaagtagttgggaaateatcaagttcaagtgctttttgtaateaecaagctcaaaactttgga 

ggaaactctcaeecattttgccctaatcccagattttagcacaggactttggaattttagaaagtacttetaggttgaga 

aacaagggtga ttt tcea t a tagagacc tc tg tagcacaaccc tc 1 1 tgtatg tg tccca tgagag tacaggat 1 1 taca 

tggc ttagagaaat ae tgtgg tgg tgaaaata tgca taaaaaagcgagagattt tac tt t tgaaa tgc tgccaeiagtgaa 

atctcttttggataggcagatgacagtttagtaettgcaggtetttegattctgacaactgtcagaagtgaatttaagtg 

ctgg tgac taggc tgccagagaaacaca tgtaae tc ta t tc t tatcacccccca te t tcagacat taaaat 1 1 tet tg tg 

tgtaaaacataatgacaaatgeiaagaaagge tgecag t tcagaaccaatctc ttt tcacgc tg t tgca t tgaggagagga 

aaaaatccettaaatttgaattceatgtggatgetcttatttaatatgccgtttcaaagcctgtatgcgatcaagccaga 

gtattatcaggctctgagttcctgtgtgtattcccttggtccatgctccatctcaagtgtttetccacactgggataggc 

agcatattgtgtagtcttagagcagaaaaggtagaaacagattcttagtacaaacaetggtgagaaaageagcatattgt 

actgtggaaccacccgccgtctgccttctgtgggagatggggatcagcctcttcctggatctcttctttttattcgggtg 

c tgggagg tgaggc tgeaaaggagge tgttg tagagggagece tee taaccatgat t a tg taaaeaatage tgc taggca 

acttgttgtttgttttatgeaaggeagtgtgctaagcacttacatacatgtattctgtctgtagtttgttataagagtce 
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tgcacacttaaccatgattatccctactttgcaaattacaaggttgcagtgcagagagcctaagacaccttctcaacttg 
agcaagtaggagacccagaatttgaacccaaacctccctgaatccaaaacgaccctttgttactttgcacacttcagtca 
taggaatttcagtgtggatttctgtattggttatgtattgccgagtaagaaatgactctggccaggcacggtggctcaca 
catgtaatcccagcacctggggaggccgcagtgggtggattgcttgaggccaggagtttgagaacagcctggccaacatg 
gcaaaaccccgtctgtactaaaaatacaaaaattagccgggcatggtgatgcatacctgaagtcccagctactcaggagg 
c tgaggcatgagaat tgct tgagc tc tgggg tgggcagaggt tgcag tgagccgaga tea tgcc tgggggacagtgagac 
tctgtctcaaacaaattactccaaaatttagtggctaaaaaacaacataaaacaagaaacatttctgttgtgcctcctct 
tttccbccctccctctttctcatagttaatttgtttgaatcaggattcaaataaggtcgatgcatttttaatctataggt 
tttgtctcctttttttttttttccttaaaaaaatttcttttaatgcaacagatttaagggaccagggtgttctctagagc 
aggggtccatgacctctaggccacaaaccaatagcggagccgtggcctgttataaattgggtggcatggcagggagcgag 
tggcgggcaagcataggaagctgcgtctgtgcttagagccactctctgtcactctcattagtgcctcaggtcccccccct 
gtcagaccagcaacagcattagtctcataggaccgtgacccctattgtgagccacacacgcaagggatctaggccgcatg 
ctcctcacgaaaatctaatgcctgatgatccgtcactgcctcccatcaccctcaagatgggattgtctagttctaggaaa 
acaagctcagggctcccaccgattgtacattatggtgagttggataactatttcattatatattacaacgtaataataat 
agaaataaagtgcacaataaatgtcatgtgcttgaatcgtcccaaagccacccccactcctttgagtctatggaaaaatt 
gtcttccagaaaaccacttgctggtgccaasiaaaatttggaaacctctgctctagagatttctgcagtctgtactttggt 
gattgcatctctgtggtgtcttttaacatactcttctgtctcctgagtttcctgtgaaagggtagataactcaagagact 
caa tcaac ttcgtgt ttaa tg 1 1 1 teg teiagaatac ttcagaggcagtggtg taa tcgtccaagag t acacacaggcc tg 
atcgccaaagccagttttcttcaggcccagcacgtcctgacctcctcactcctctgcccactctctgtttactcactccc 
attttcattcattgattctgttattctccactaagttttctgggtggtcagcccaggaagcctcagcctgcttccagata 
gttctcctcagagttagcggtagcagcggtcgtcggccagtgtgtctcgggtcaaattattttgggcattactggaaagt 
tccaaaccaaagcctgtggtaagttccacccaaactggcttgtgagggatatttttgtttaaactgcattcctggagagg 
agccagacttcctactgcctggcagcagtggtgtccataggagtatgaagagctgggaccctttctaatcactcagacca 
aatattggggttttctgaaatggactgagaaataacagtatgtttttatgaggctttgcacattttccttcctagcagta 
gc tgc t tag tctac tgaaaag tgca ta 1 1 1 tgaacagggcc tagaagag ttaacagc tec tagagagagg tgc tc tg taa 
tactttttcttcttcaaaaaatggtttatggctgggcgcaatggttcatgcctataatcccagcgcttttgggaggccaa 
ggtgagaggattgcttaagcccagaaacgtgagaccagcctgggcaacgtagtgagacgctgtctctgtattacaaaatt 
1 1 ttaaaaaacggt ttg taggtcc ttaagt ccc tga taaaa tagagaac tgaa t tgcaa tec tggaac t taaaaagt tgg 
tgaegacaectgagatatttattacttagattgcagttactgggtcagcttgtataatactgaecaagggtttttgattc 

I I ec t ggaa 1 1 ga taggaaa 1 1 ea t a t taaaa taa t taeecaag t cc aaaca 1 1 1 1 1 agaac tgca 1 1 1 1 tga tea tgga 
tttttatgtetctte tgaac 1 1 1 c tgt caeegg t a t aa 1 1 1 aaagaaa 1 1 a t ac t taagc 1 1 tg t c tcac 1 1 agaaga t a 
atatagaacagtggtgtttttttaattaaaaaaaaagt taaaa taacggttttgtatccttgctttacttcttaaacata 
tgggaggaaaaaaaa t c 1 1 taacaag tttatttat 1 1 tea ttttetgctaaattaettt cagaae t tgaa t e t ac t aa tc 
ccagatataatattcttggattcatattecaaattttgetgtetcaaatccatctagggaagtgggtgggetataaatta 
t aaa t aaa t tecaaat t ttg tggga tgaa t tacce t gaagaceaaeg tg t aaa 1 1 aca tat taa tc tttctttttetcee 
tage t egg 1 1 1 1 aagaa taa tg 1 1 1 tagccewica t a t c tgca 1 1 ac t c 1 1 ggc t caa ta tgagaaa tccatttttggttt 
gcc taacagaaga tea tg t tgc 1 1 1 gc 1 1 c tc taeaeag ta tgaaaacc caaagaaaagaaaaacagaggeag 1 1 1 1 1 1 g 
ctctaatgaatgctctaaatctagetettaattatgattttttaaggaaaattttgaaaagtctacaagttaaatttttt 

I I I e t a tecca taca 1 1 1 1 cca t ec taaggea t tgaeiaaagcac ac t g t gaaa t ac t tag tg ta tc t agaaae a t eaggg 
aagaatgcttccctcctaagcaaaattttgeettetgaaaettttteageattcagcctttttatataatacttagaaaa 
atatttetgaaatagatcatacactctcttcccaaaaacateaaagtatgaccgtaaagggcagaggtaggcaaaettec 
tgtaaggggccagagagtgaatat t tgagaat 1 1 ccagactattaagacc tc tgt tgeaac tgct tgc 1 1 1 tgccg tggt 
agcctgaaagcagccgtagatagtatggaaatggatgatcatggcagtgttgeaataaaactttacaaaacagaeaatgg 
gccagattggccaggggecatagtatgctacccetgggcaacaacctgtatgecctggagtagtgtaaagaaegtgggtg 
t tgggggt caac tgacget tccage tc taccac ttae tggc tg tg tggc 1 1 tgggcasuie t ac tgaaaate tcteagcg t 
caetttccaagtgtgtgtaatgtgtattttcacagtgctttgcaggttgttgattattgaaaatagecataa tgca tgaa 
attaccagacacatctcaetttatggagcctggggctattggtaatatgcatttctttctcatcttgatcgtaaaatgat 
c t tagaaagg 1 1 tc tgagaatata tagag 1 1 taagacagcaataagacaae taa t taat taaacaggaaaagggga tg 1 1 
gtgcteagagaggaagtgtgggtctcataagggetttcacaategtttgagaggacacgtgtgatgtetcatgectgtta 
tcccagcactttgggaggccaaggcaggcaggt tgc ttgagttecggagtt tgagaccagcctcgg taat t tggcaaaae 
ettgtctetacaaaaattaeagaaattagttgggtgtggtggtgcacacctgtagtcccagctgcttgggaggctgaega 
gtaaggatcae t tgagecagca tgg tggaggc tgcca tgatca tgc tac tgcac t ceagcc tgggcaacagagceagaac 
c tg tc t tgtaaagaaaaggaaaaagagagagaagggcagaaagaaagaagggaaggaagaaaggaaaa ttgggeeeagga 
atgatetttacaatgcctgacaaccaagagaagaagggaaatgagcctcaeattgcetgcaagctctaaggtgaeaagag 
ceaagagaaat tat tgttactgtag tga tgttccactgaggatcataaag tac tttattactc tac tgagta tggt tat t 
ggatatgtgttcttctttttctttttctttatettttttgctattettttgttattcttgatttatgctgatggaaagcc 
atggacceaaggatgcttcacagttttetttaggagtaaatgcttagattecatgttetttgaeatgagctatgtctgtt 
cc t c tcgag tggaageatcctt t teaga tgagt tgecagaaaageagccagc te t ggataag tgaggtacagCagaacae 
ac tgcaaatac taggaatcc t taag tacag tggaacecccuiageae tc tacc tgc t ttc 1 1 tc tcacctcc t taaaaac t 
ttttttgccctcacetcatcatttatteagcagtcacaaeagtgecaagaacttggctagagattggaaataaagcttat 
gccttctcteatatcteetggaccttatttetttcttacaagaattgtgatgcttaaceagtttttttgataaccttttt 
ataaatgccaaecctteeaaaaaacctgceceeetggtggagagaagaattattacatcaattaggggtcacttagcatg 
aeatttgtcggaaaaaaaaaagttagtgageetttttgccatattaaaagtcateaetgeeaagacataaatgaaaatgt 
gttcgaattaaccacaccaatgttcacaaaataaacatttttgatttcccaacagaateetaggtctaactatcactate 
atctttcatgaaatcaaagccatatatgtaaattgaaeacaactttcccttccatagagagtaaaaaccacgctttggag 
ggtagataeaattaccecagggttgtetttteccactcctcaeaatcecaccagtgeacatgcaaggtgatgtecttcct 
ttagctatagcaaataatgttaattattgttggtgttaaataatgattatgtaaagcaetagactagaeattegteggca 
aagtttttctgcaaagaatcagatagtaaatatttttgctcttatcagccagacagtctctgcggeaaccattcaagcat 
tgttgaatacattgttgagtgttaeatgagactattgtaatataaaagtagcctgggaeaacaeataaataaetgggtgt 
ggctgtgtcetaataaaactttatttacaaagaacaggaagtggettggatttggtatctggcetggcagctgtggttta 
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ctactccctagacggtggcccagagaccctttaaatgaaattcattttaccagcaccctctttcatcatgagaaatatat 

tctgtttttctcagaaaatgggttatgttaggtctggtcaaggtaaataagtgttgagagtcgatgttgtgtgcatagtt 

aatttcaattctttgaagaagctcccccatgatatttcacggctgagagaagaggaaagagtttaagtggaacagtgtgc 

tttgctgagctttggaaatattaccatatagggaagcaggtcaataagacaactaagtgctgtttcaataacgaagatac 

tgaagcgctaattggagtatggaaccatataatgatgataataattgctaatatttatcagctatttatcatgtgtcatg 

tacagctaagcacttacataccatcccattttatccttataatgactctaagagtgggttaaaaaatggcagaagaacgg 

cagtgtttaatggttcagcctggtgcaatgattaaccacattttacagactaagaaattaagaacctcaaccaagttcat 

gc tagc tgg tacgcag t aaggc tagaac t teg tccaaaatc tc t tc t tc tg t tgagc tcagc t tgca tag tagc t tggag 

aatcagaaagatctgtctcatttgggaaatataccatgtaaaaaacattgtttctaaaggagatttgtcccatgagtaaa 

atagatgatggacagccactgcctagtgggacaattagaaaggtcagttcaaggttggaggagatgcttctttcagccaa 

ttttcctttccctcaggatcacctcaggtgatccgcccacctcaacctcccaaactgctgggattataggcgtgagccac 

cgctcctggcctttcagccaattttctatcaccaaagggaaatcgttttgctggaatatgtggtaaaggaggttaaagtc 

aaaagaeiactctcgtcccgctcagttaaggtactcagactattttccaaccaatcaaaaggggtgctgcttcatggagtt 

cgtttaagctaaagcggcagctgttgactgtcatttgcatcatctttaaacatttactgtgaatgtcactgtccatttcc 

actttctttctacttgtcttcaaattgatgcttatcaagtagacagaagaagaccaaggggtcgttttgctatttatacc 

tccaaattgatggcgtgatcactctcaagtgcaaacccagccctgacactgtcctgtttggcaatgtcctgctatctgac 

ctgcaaa tagc tacact tec tgctgtggcccacccaggcctctgggatctgatcccttctcccatgtcacctgtggccat 

gtcttccccaacccaggcttetetgctccgctgcttatgttctggatcctgactctgagcctttgettgtgtggcteetc 

tcccac 1 1 g tcca t ca t ca tg t cag t ta t g t aagc a tagat tccaaag tgacaa tgaggg tag t tagga t ggcaagagga 

gaaaaaccaagctggatteatggattgcttgtgtcacgtcctctacagagcctcccttctacetgctcttacccgggcae 

cacagttgatgagttattttttggaccatcageaacacacecaatcattgtaeatggaatacttcggggatgcttttttc 

tatattgattttacttcattaaactgagctccagaggagcagaaactttggctaatacatcttggtcttagcttgtaata 

tetgtgetaeaacttattaagatggtgacgctggcaaattccttaatctttctaattcttaatttcatcagcaaagatgg 

gaaaggatactaacaccac tc tggg ttg t tgagaggat tcaataac tgaatatttataaagcgtag tacc tgatac ttaa 

taaaaaagtgaattttaacctgtgtcatcattgccatcgtctttatcaccctttgcaactattacacttactgccttcta 

g tacaaggaaggggatggg tggctggc tggc tagg tgga tagaagga tggaagagg taagtacaaggaagggga tgag t g 

gc tggc tggc tagatggatagaaagatggaacagg taagcataaggaagggga tagg tggc tggc tggc taga tgga tgg 

aagggg tggaagggga tgcaagagg taag tacaagga agggga tggg tggc tggc tggc taga tgga t agaagg 1 1 ggaa 

gacaggaaggatggaacggaagaagagggaagaacaaaciatggaaacacatag tac tgt tagg tgaac tgaac t tc taag 

gtgecgattctcagtgatagaa.tct tgag ttga tacc tec ttgggtggcatggagccta tacc tttgtagatcttgggaa 

acaacttctaaaagaatcatagttgtatgtaatcgtaagtacEiacaataattattttaggcacataacccaaaggttttc 

ttacaaggaatctatgaaccttaagagtgagggcttctgttaagagtaagggcttccctggagtggacatggatcatggg 

actgagccagcttggca ttg t tggg ttgaacagggagcgacacctcteagcccagtctatcaagcctgctctttgacctg 

cagtgagaccacccacgcagacatcaatgcagcaaatccccggcgtcagggtttcaacatttggttactctcagagaact 

ctcgatttatataagacttggaaaaagggtttgagtttctgtggtttacaattatatttcccaacttggcccatgaatcc 

agcttggtttttctcctcttagcatcctaaccaccctcattgtcactttggaatctacacttacataattgacatgttga 

gaagc tg tgc teiaaaccacac tgaaa tcacat t t taa taacatgggaacca tc t t t tcccagtaaattg t tgagaagc ta 

attcttgtcagcctaaaaacttgaatatacatttgaataaatcagcggtgctacaccgtggcagcctgctgaaaatcccc 

aaggaaga t ta ta 1 1 1 1 tag t tgagc tac t tg tcac t gcac tg tg 1 1 1 1 taa t a t tg tgagc cc tttcttgtcttcatct 

ttgaagaatctattgcataccttgtcattcagaaaaacataaacgggacctctcaattagcggtaaagttcgtcagttta 

acttttaagcttaaactcctgttatagttcggtcacttactcgtcatcaaaaagatatttgagctgatattatgcaataa 

t t c ataacccuiaaacaggaggaaaaggt c t c tg c t t g tc tcagaag tacaag t ta tgca taagg tgacaaa t tacacagc 

tttgggaaatgggccttaatggaatgcaccaggctaatagaaaagcagtgcctcgatttcccacctcagatctgaaaact 

cc tgagagac tgacagggcggt tccccagc t tggc tgtgtgaggaa tcagc tgggaatcgaacactgcagtc tggc tgt c 

acccccaggat tc taa tgtaat tgt tctgcattg tgga tcgatggatggatggatgaatggctggacagaaggaagactt 

ggaaaaaatetttgagtttctgtggtttacaattttattteccaacttggcccatgaatccagcttggtttttctcctct 

tacca tec t aaggagggagggagggaggaaaagggggaaggaagcagg t agaaagggaacagggaagggaggac tggagg 

aaggaaagcagggatcattctggagtgtacactgggcatttgattttaaagcaaatccaagagatcattaaatttgtatc 

aatgaataeaetgttacccctcaatcaacactacaatteatatatcagaatagcactttgttacttgtttctgagatagg 

gc tggc tctgtcacccagactggtgtgcagtggcacgatcacagc tcac tatggcctcagccaccagggttcaagtgate 

cttccactcagcctcctgagtagttgggactacatgcctgcaccatcatgcctggctaatttttatttttgtagagacga 

ggtcteettgtgttgtccaggctggtctcaaacttctggaatcaagtgatcctectgeccctgcctcccaaaatactggg 

attacaggcatgagccatgtccagctgccttgtttttttctttgaagagaatgatagaccttccataggaaaaatgttaa 

atatgtgtgtgcagaaataaagatttcaaatatttttgcaagatattttttctaataccacttttttccacattttccat 

aat t tagtgaaga tagtaaat taacciaagtggaaaagac tgaacattttaagaaaagccaag 1 1 taaaaatt ttgaacc t 

aatatttctcaaagtagctaaaattcagatattgagaataaattcaacttgacatggcaaaattctaataggctgaaata 

atgttttggtc tagaac tatgtagctttgtgtagcccatcaattgtctaaaaaaagagtaacctattttgatgaaac teg 

ctg ta tc t tgtaacctgta tee tgtct tgg tattgtgggagtatatatgatttaggggaaagagtctggagagacc ttaa 

gt c tgc 1 1 tagggaaagggtgaggaaccccae tgaagge tact taacgeat tt tgagaa t g tcag taaagatt tc tcaga 

geecagggattttttttttaattgagacataatttacatacaatacaattecttttttcttttttttgagacggagtctt 

gctctgtcgcccaggctggagtgtgatggtgtgatcttggctcactgcaaectetgeetcctgggttcaagtgattctcc 

tgcctcagcctactgagtagctgggattacaggtgtgcgccaccacgcttggetaatttttgtatttttagtagacacgg 

ggtttcaccatgttggttaagctggtctcaaactcctgacctctagatccgcctgcetctgtaateceaaagtgctggga 

ttacaggcgtgagcccctatgcccagccaaaattcattattttaagctgtacaattgagtgatttttatcacgttcacaa 

agttacgttaccattaccacttttgaattgcagaacaatttcatcactccaaaaagaaatcccataacaattagcagtta 

ttttccatcgctacttcctccagtcccaaggcaatcagcagtctgctttctatctctacctgtttgcctattctggacat 

ttcatatgagtggagttagataatacatggccttttgtgactggctttcacttagcataatgttctagaggttcatttat 

gttattgcaaaaatcagtacttcatttctttttatggctgaaaaaaattceattatgtagatgtgccatatttgtttatc 

tggttgtcagtcggatatttctgttgttcctaetttttggctactgtaaaagatgctgctatgaaeattattgtgattat 

tataeettatttgtaaacatcatgggtggggggttgcagtciaacatgttggaaagtagggttggaggtccgtagaaattg 
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ggggcttcagcacttcccccaagctcaacaccaaccccctttctgagcccctcttgaaggagagttccctgggacgtgcc 

tggtattggtacaatcagtcaggaagcatttttcctggggagaaacttacaagtccacgatcaaagccaacaagagacaa 

ggtgttacatgactcattttcggtttaagaagtgacaggctgattctaagttgggttcaattattttgttaaagcgcttt 

gc t ta t c tgacttc tec tgacc tcggaaatcia t tc taaccaa tcag tgc tggc tcccat tggccc tgggg tc tggt tgcc 

ttacagccggcgacagggggaccactccactaccacatgtgaattaatcctcaaccccagagccaagcgccattctccag 

caaggttgtacttcctcattagctattcccagggcccagaaagtcccagaggatgtcagagtacattaatttttatcata 

acatggaatctttcaggtctgaatggcagcacacggctgtcaggggcttctgaactctattacagctccatatatctcta 

ggcaaaacagaggaaagag teg tea t tggcaagggagatgtacaaaa tgca tgagatgt 1 1 1 a 1 1 1 1 1 tgag tgac t tga 

ceacgtgcttaagcacattccccaaacaatttttttcttattgtttgtaagttgtaagttgtaaatteacctctgceaec 

acctattaaagcceacteectgcattaaaactgtataaagtgtatttaaataaactctctttgcatgatgtgaatgaaat 

cgtcatctggtacttaaaaetattetataaagttattaaaaaattaatgttcccttcccatgatttttctgcagciattta 

tgca tccatgatactgeagaagtteacaaataatggettgtat tgc tgc tt tag tat tgc tttatgcctacgaaatataa 

tgttaatttgtagcaatgctaatgtgttttcaggeiaggctctttgtttattgcctttattttccccacttaecaagtggg 

taaaatgc 1 1 tgagggt tgcat 1 1 tatg ta t tcaggaggcccaggtat ta 1 1 1 taa tagaageac tat tgacaaatacca 

gtcatcccccctgtgccaggccctggatgaggcactgcttcgcatgggggctccccagattgtcccacaaggaaagcata 

gtcaaagacaaagttttcagttgtaagagtaaacgtgttctgcctaggcattgccaagtaatttactgccagctccagcc 

cttcactcaagtttcctggatacttttgacttcttagccatggatgtgtttgaaggccgcatggaccttcact tact tgc 

ac tgcagg tcagcctaa ttgca tgagc tc tgtggaccacagagcaggg t tt t ccaaagt tcaccaagacaaa ta c t g tat 

ta tc t tciacatata ttcat Ct 1 1 taaaac tgaaaatcagaagagcaac tccacc tagcagaagtc 1 1 1 tgcaaagggcga 

ggcgaggc t aaaaagta tagaagag 1 1 eg 1 1 tccagtgcaa 1 1 1 ta taaacacaga tgg tec t taaa 1 1 aagcaaa tgg t 

acc taaatgactgtgt tg tgga taa tgg taacagagggagggac tcggggg 1 1 1 1 1 taaaaagtae tgat tg ta tgcagt 

gttttaaacagataaKrtgtgatettagtgtgatgaaagatgctgggagatttcaccagtggtatcttattatttttcggg 

gattttgtaattcaacaaaattctgttgtatgccaagcataaccetaggtgtgagagcacaaggtgactteagataccac 

ctctatccttcaggggtttggggcccattattatgacttaatccattttgggcgtgagaagctgagggtcacagaaagaa 

ccaattccctctttaaataatgccaccccaaccctcetcatctgccaggtctttcecttcttetatttgtatgaataata 

gtcactttctcttgtggagttcgctaaattctactttggcctatCcLaatttctttcatatcacaactaaatttcttaagg 

acgggactatggttcatttgteagaegaaeaaatgggaatttgceaagagacacttgggttaatttacgtcttttccatc 

caagggcactatgttgeiagtgaggctagtaggteatgagtgtggttgaagttactttttcttactttcccgaccagcccc 

ca tec ttac tgcacttaaagt tgat tgtceatt t tat taaa tgtecccaggaagccagaacacagggcag taaagtge tg 

aatgcaaagggcaggagaaaaatggaaacaaccagaactgtaacaccaaggaatgagacctgcatgtcagata tea tgcc 

cattgcactaagtgccattggggcaeaattatcaaatggatgcattttccctagaaaaccatcttggagagcatgtggac 

gtacttctattttacatttccccctatttacaatcaatgagattgagattttgttgctgggactgctgatgatgggatgg 

gaaaatataatcaaggtaatggacatgaggcaaaaacctaaggaaatgacaaaaacaagagtatttccattttcagttaa 

gcgtatgtac tga tgttctggaattcactataagaagttgcaaa tgg tgca tgaaatgaaaaat tec tgg tgg teteeag 

gggacacagccggtgctgtgctccactctgggtaactgttttggattattttetetattccaactgaaataaaaaaaaat 

taattaaatgtggctaggttatcttgacagcagaatccattcecagttaattattattttaatacttgatggtgtctgtc 

aaattgtcgacatgtgaeggteettteaaatttaaaggaatagctgatggteactggecacceaagetgatactgatttt 

ata tg t tga tgtttc teat tttatt tgc tett tec ttgaa tat t tat teaggacattctetaecagacatattgagtaag 

ggcaacagaaacaatacataagtatcttataaatgtggaaaacaatgtatatgtgttttttatcteteaatgattggtgg 

gtaccata tccccaaagtagaa tgageat t tgagaaaacaggaaata tec tc 1 1 1 taggcaccate tc tg t caagge t ga 

tgctgggctttttatatattttetetaattettgtggctgteaaacaaggtgggcattatcattccctttataggggaca 

cage tg tggc tcagaggggttta t tcac 1 1 tec tgagggecacacacat aa tgagaggcagaeaeagg tgacgaag tgag 

ttttccctgteacgccatcttatctgtcacatacctctctgacatgcteiaaattgcactaaaeaaaagaattctettatg 

cacatatcatgcaaaagatattetttaactggggatcatgtttctcattccatcaatagaatgactaacattttctgagg 

gtgtctcacgtgaaagtaaatcgcteatgtttgttctttttaaaagatgcccttcgtattgtgtatcttgcagtcttgct 

ttctcaaacttaagccaactatatcgtcattttCgcaaaatcactgcgtcagtttactattatttaatgtttattgctac 

caattttaagaaatcctttataggactatttgtgaaattgattttgtgaggatgatgatataatttccattacattacag 

catataaatataaatatatatatatatatatatatatatatatatatatatatatatatatatattttattatttttttt 

tgagacggagtct tgc tctgtcaccaggc tggag tgcagtggtgcaate tcggc tcac tgtaacc tccgccgcccggg 1 1 

caagegattcecctgccttagcetcetgagtagetgggactaeaggcatgtgetaccaeacccagctaattttttgtatt 

t tag tagaga tgg tt tcac catgttggcgagga tgg tctcagtctcctgacetcgtgatccgcetgecttggeettceaa 

agtgctgggatatacattttttttttttttttgagagatggagtgtcactctgttgccgaggctggagtgtagtggcgea 

atetcggctcactgcaaectcctcctccggggttcaagegattctectgcctcagcetccccagtagctgggattacagt 

cgtgtgccaceacgcctggttagtttttgtatttttagtagagatgggttteactgtgttggetaaggtggtetcaaaet 

catgaectcaagtgatccgccegetteagcctcecaaagtgctgggattacaggcgtcageeactgtgcecggecggata 

gaaa taa tt 1 1 tataaactecttgga tgc tace taaaa tea tc ttg 1 1 1 tgetagtggcaca tgctgea 1 1 1 tgggcage 

tgtggcettggtggattgetgaagtagatttgaccttacctggactgaggcagctgttgaagggaattgctgtgttcagt 

gtataetgccatecatgatttcacgaaaccagctccagctatttaagcaggggtcaaacttagaattctacattattttt 

t tccc 1 1 ttc tgggaggaaagacag 1 1 gaacaccagcaaagac taagaaa 1 1 tc t tagaagae tgcgggccc t tgggccc 

1 1 tc tat tgaa 1 1 ccagag tat ttccaaa tac tatgaag tc ttgcagc t tag t tgagaaa tgccccagatgg tg tgaca t 

tc tgc ttccaggagggattggaaag tat t tec t tttacataacat tccac teagc teat tec 1 1 tgc tgtg tctgaaat t 

gaatceeecaaagecacaattatcttaaeattcagaagagtgtttatttaatetgcaaaatettgccteacttttgggga 

gcatgttaaeaatttcacttacaaatcttctgtgtaactcaaceecatggtggtgtctaetgctgctcetagactcttta 

aagcaccttteteatetcaggtttgaaatgatatgtctcattettgggttccttgagtegtaatgggtttgtettgtctc 

eacagcataaatgaetctttettgatcaactagaaccacateaaettcttceetccagctteagtgatatattgtgaaac 

a tggc tat tcaacg tec tgtagaeeaaatgceataagaaaaatagcattgattcaaacgtateeatecaga tace taaaa 

aagttttacttcttaecacatcttgagtetgggcaaacacgcaetteetatggaeattgattactgtetactgtagagat 

aaca 1 1 tgcaca t acagat tatggcaca tgg tagaaag tg t taagtaatg taggaa tggacat atcccaagcaaaa t tgg 

aagccaagtcccctgtccc tgc tcaagt tgg tatgaccggtgtatgg tgcc t taatgggtact taaagtccaggtgagag 

tggcaggaggcagccaaatgcctaggtagataggagccgg tccctgt tgaaaccccac 1 1 ccaagt tgaagacagt t taia 
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agac tgaaagccaagc tacaag t taaatcc tcggaccaga t tgagaac t tg tc 1 1 c t tac C tggtgcactc 1 1 c tga t tg 
atccccacctttcacctattttacatattcctgcccttccctaactggtttcctatgctgtcatgcccacctttgagtgt 
tgccttcactttaaccttctgtgcatgctcacaaagtaattagcatgtaccctccattctgagCtaatataaggccccag 
acccagccacatggggcaactttactgccttcaggtaggggaaccacccccaccacattccctctccactgagagtCCCc 
cttttagttaataaattcggctccactcactctccattgtctgcatgcctaatccctcctggtcgtgagacaagcagttg 
gacccagc tgagctaaggagcagaaagaccgtatcacagggaactcg tgtaacagct tgatc tcctgtcc tacgtagc ta 
tctattggtaagaagttgaaggaacttgtgtcattccgttgtgcctgtcgtcttgaccttgtaaaaggtcttgggtaagc 
atgcaagaag 1 1 1 tgaagagggaga t acagc taat t tgcaga taaagagcaagggaagaattcc tggagaaaggaagag t 
t tcctgag tcacc 1 1 tgggagg taggaagggt t tgacatg t aagctgggca tc tgggaacgag tgaggga t tc tg tgagc 
cccatctcagtggaccactcaaggaaggtgggtaagccctgggtaataagtgtgtaagcagggaacagaaagtactgtga 
tttaaaatatgttaatttttctactgtacagatgagagccagcttggagatgggctgtagctcaagcatcttacctacct 
ctgatttcttaatgccacgttataaggctgctgcttatagctcttgaagtcactccaaaaacagatgagtgagaccctgt 
tgctaaagtcccactgggtgtagattattcacagatgtatacacagtggctcactccaggtaggatgtgatcagtgcttt 
tagaaatacagaaagtccta t tgg 1 1 taaaaaaaat 1 1 1 1 1 1 1 tgtaatgaa t tgagt 1 1 taaagc tagcac tg tacaat 
aaaagggtgaa 1 1 tcac ta tgaat t atgacaaacacag tcatagagctgcca tea tcac tgtcaagatacagaacagtgc 
catcaccccccaaatgtcctctgtgccttactgtagtcatacctgctcctgacacctagcccctgggaaccattgatctt 
ttttctgtccctagttgtttgtcttccctgcaatgttacataaacagaaatattctgggtgtcgctctcggagtttggat 
tccttcccatagcataatgcatgcagtatttgtctgtttctgctaaattacccccaagacctagcgacttaaaacagcaa 
acatacactatctcactctgttagctttccattgctgttgtaataaattaccacaaacttatgtgcttaaaacccaaagc 
tactatctgacagctctgtaggttaaagatttgtcatggttctcattgggctaaaattaaggtgtcggtatggctctgct 
gctttctagagactccagggagaatatttttccttgccttttccagcttctaaaggtcacccacaacatcttcaaaacca 
gcactgttgcatctctgaccttagccctgtagtcccatttccctctagccacaatcgggaaaggatctcaggactgttgt 
gatgacactgtgcttacctagattatctagcatgagctccctgtctcaaggtgatagagtttgcatgtttttctcctgca 
catgccatgttgaaacataatccctagtgttggcggtgggtgtgctgggaggtatttggatcatgggggtggaaccctca 
tgcatgacttacggccatccctttggtgataagtgagttcacatgatatctggcaccttccttcctctgttgctcttgcc 
ctcaccatgttagctgcctactccccttttgccttccgccatgactgtaagcctcctgaggcctcaccggaagccaagca 
catgcttcttgtagagtctgcagaatcggagccaattaaatctcttttcattataaattatccagcctcagttacttctt 
ta t tgcagtgcaagaa tagc t taacacacaagg tc t taccc t tga tcacgg tctgcagca tg tc t tt taccatg t aagg t 
eiatatgttcagtggctgtgggggttaggatgtggacttctttgggggacttttatttttcccagttactatttttgtgac 
tcaggaatttagggacag 1 1 tggc tggt tgtt tc tggc tcaggg tc 1 1 tcttgggc tgcaa tcaaga tg tcagc tggggg 
c tgggcatggtggctcac tec tgt tatcacagcact t tgggagg tcgaggtgagtggacca t ttgaggttaggagt t tga 
gaccaacctggccaacatggtgaaacgccatactaaaagtacaaaaatcagtcgggcatggtggcacatgcatgtaatcc 
cag 1 1 agt tggggggc taaggc tggagaacagcttcaaaacaggaggcggaggt tg tgg tcaac tgaga tcacaccac tg 
cac tccagcc tgggtgaaagagcaaggccccacctcaaaaaaaaaaaaaaaaaaaaaaagt tagc tggggccaagg tea t 
ctcaagtcttgactatgaaaggacctaacatcatgcatggctgttggcagaggtcacttcctcatgagcattaaactgaa 
age ta taa ttcc tgac tattgaccagaggcgaacc tcag 1 1 ccc tgccatgtgggcc tc 1 1 ca tggggcag t tga taaca 
cageag t tagc ttcea t tgga t tgag taagcaagagagcaagaacaggagtgatacagaagccagca tc 1 1 1 1 tg taaac 
caatcteagaagaattgtccatgattttttctatattctgttagttagaaaeaaattcctaggtctcacccaeccttgag 
gtgaggggat tacacaaaag ta tgaaaaccaggaagcagggagcat tgggag tea 1 1 tggacce tgcc t aggacag tgeg 
tttgagatccatgttattctatccattagtagtagtttgettctttttaatgttgagtaatatgccattgtgtgtttata 
tattgetaatggattttaaagagggctaateaacgtgttgattagagggaaattttcttcagtgaaataatatttgagca 
aaacc t tgaagaacaa t taggaa 1 1 tgacagagggeia tggcacgaa tciaagaeccagac t taa tcaag tgaggggcg tac 
tea tccatggggrcag tgga tgat tcacc tgcctggagc ttaaggagaagggggtgtagt tgt ggc tgt tgc tcagaagga 
ee tgaatg teaggatgea 1 1 tg tgt t taa 1 1 tagcagacat t taggaaccattaaaagt 1 1 1 tgagaa tgggtagg taag 
attagaatagtattttaataagtttaactaggctttaaccagtattcataaaeiagtcaagtgggagaatgtagaggtgac 
aagaceaggtgtgggct attgcagtgac tacacc taaaaggaacagagc tagaaaca tggag tgaca tgaagtcat tg t c 
t tagg t tgg tgatce tagcage tgagce tggga tggggat tc t tgt tec ta tga tcgat taggggae tg t tcecagagga 
gageggagagggaagcaggc 1 1 tgg 1 1 te tac tggaaa tc tg 1 1 tccgeccgat tccatggggagc tc tggagaaggaat 
tgtatcaccgagttggtcctccttgtcaacagagcatgtgtgtggggg tga tgac ttcceagatgtgagggtgcccteea 
geaatggacagttatctggaggagttcatgetecagaggtagtgcctacctctgtggagaagtaggggatggcacggtat 
tt tagggg ttcc taa tgagce tagaga tac tgacat tgaceag tea tggagactaaaggagaaga tggga tagcacat tg 
tt t tee taag tat ttgtaatcaggtgac tgg 1 1 tgcaaa tatcageeat tacagaaataaagggtatcaggacggggagg 
aga tg tgaggagagaagatgaagag 1 1 tagat t taggca tg t tgacce tgaat t tgtgggaaggta tecaggaaga taag 
ae tgacaggtgggaa tgaga t tctggaact tggaaagaa ta tag tggc tagagc taagggt t tgaagtct tea tggcga t 
1 1 tg t tcggag tcatggat t tggaagaaac tgc tgeaggg 
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Sequence Listing 

<110> Women's & Children's Hospital 

<120> For Proteins, Nucleic Acids And Methods Based Thereon 

<160> 53 

<210> 1 

<211> 27 

<212> PRT 

<213> Homo sapien 

<223> Cancer associated protein 

<400> 1 

Thr Gly Ala Asn Ser Gly lie Gly Phe Glu Thr Ala Lys Ser Phe 
15 10 15 

Ala Leu His Gly Ala His Val He Leu Ala Cys Arg 
20 25 

<210> 2 

<211> 44 

<212> PRT 

<213> Homo sapien 

<223> Cancer associated protein 

<400> 2 

Leu His Val Leu Val Cys Asn Ala Ala Thr Phe Ala Leu Pro Trp 
15 10 15 

Ser Leu Thr Lys Asp Gly Leu Glu Thr Thr Phe Gin Val Asn His 

20 25 30 

Leu Gly His Phe Tyr Leu Val Gin Leu Leu Gin Asp Val Leu 

35 40 

<210> 3 

<211> 32 

<212> PRT 

<213> Homo sapien 

<223> Cancer associated protein 

<400> 3 

Tyr Asn Arg Ser Lys Leu Cys Asn He Leu Phe Ser Asn Glu Leu 
1 5 10 15 

His Arg Arg Leu Ser Pro Arg Gly Val Thr Ser Asn Ala Val His 
20 25 30 

Pro Gly 

<210> 4 

<211> 34 

<212> PRT 

<213> Homo sapien 

<223> Cancer associated protein 

<400> 4 

Asp Glu Leu Pro Pro Gly Trp Glu Glu Arg Thr Ihr Lys Asp Gly 
15 10 15 

Trp Val Tyr Ty^r Ala Asn His Thr Glu Glu Lys Thr Gin Trp Glu 
20 25 30 

His Pro Lys Thr 

<210> 5 

<211> 34 

<212> PRT 

<213> Homo sapien 

<223> Cancer associated protein 

<400> 5 

Gly Asp Leu Pro Tyr Gly Trp Glu Gin Glu Thr Asp Glu Asn Gly 
15 10 15 

Gin Val Phe Phe Val Asp His He Asn Lys Arg Thr Thr Tyr Leu 
20 25 30 

Asp Pro Arg Leu 
<210> 6 
<211> 21 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer for 504CA (forward) 
<400> 6 

aacacagctc ttatcacatc c 
<210> 7 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer for 504CA (reverse) 
<400> 7 

tggctgtaing tcagaactg 
<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer for GenBank AA368108 (forward) 
<400> 8 

taatcctcag cctctagaat gcct 
<210> 9 
<211> 24 
<212> VtiPi 

<213> Artificial Sequence 
<220> 

<223> PCR Primer for GenBank AA368108 (reverse) 
<400> 9 

gtatgatgat tttcagggag aaac 
<210> 10 
<211> 24 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer for GenBank AA398024 (forward) 
<400> 10 

tgtcctcaac tgattcttac aaac 
<210> 11 
<211> 21 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer for GenBank AA398024 (reverse) 
<400> 11 

tcaatgggtt aggcacagac c 
<210> 12 
<211> 24 
<212> CNA 

<213> Artificial Sequence 
<220> 

<223> RT-PCR Primer HHCMA-F, PORIII specific 
<400> 12 

atcttggcct gcaggaacat ggca 
<210> 13 
<211> 24 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> RT-PCR Primer wb85-F, FORIII specific 
<400> 13 

ttattctgca cttttctggc ggag 
<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> RT-PCR Primer P0R-ex3 
<400> 14 

gaacaagaaa ctgatgagaa cgga 
<210> 15 
<211> 24 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> RT-PCR Printer wb85-El2 
<400> 15 

ttactacgcc aatcacaccg agga 
<210> 16 
<211> 24 
<212> mPi 

<213> Artificial Sequence 
<220> 

<223> RT-PCR Primer wb85-A 
<400> 16 

tgaattagct ccagtgacca caac 
<210> 17 
<211> 27 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> 5 'RACE Primer coxido-R 
<400> 17 

ttatttcagc actcagctca aagtcac 
<210> 18 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5 'RACE Primer HHCMA-B 
<400> 18 

agcaaagaga cctatgccta gccca 
<210> 19 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5 'RACE Primer coxido-32 
<400> 19 

atatctgtaa atcgatggga ctctg 
<210> 20 
<2U> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5 'RACE Primer coxido-21 
<400> 20 

acatgaagag gcacattctt ggcct 
<210> 21 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5 'RACE Primer vd>85-E 
<400> 21 

tcctcggtgt gattggcgta gtaa 
<210> 22 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



3 



24 



24 



24 



27 



25 



25 



25 



24 
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<220> 

<223> Direct Sequencing Primer tj96-C 
<400> 22 

ggaggcagct cgtcctcact g 21 
<210> 23 
<211> 53 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> 3 'RACE Primer RACE-AP/VAR 
<400> 23 

ggccacgcgt cgactagtac gtacagtttt tttttttttt tttttttttt ttt 53 
<210> 24 
<211> 23 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> PGR Primer overlapping FORI exon 8/exon 9 boundary 
<400> 24 

accaagtcca tggtttcaga ctg 23 
<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 3 'RACE Primer RACE-NESTED 
<400> 25 

cgtcgactag tacgtacagt 20 
<210> 26 
<211> 27 . 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> Exon 9 Specific Primer #9327 
<400> 26 

actgcctggt agaaggaggt cacttct 27 
<210> 27 
<211> 20 
<212> ENA 

<213> Artificial Sequence 
<220> 

<223> 3 'RACE Primer AUAP 
<400> 27 

ggccacgcgt cgactagtac 20 

<210> 28 

<211> 1225 

<212> RNA 

<213> Homo sapien 

<220> mRNA 

<223> F^OR I mRNA 

<400> 28 

ggtctcgttt ggagcgggag tgagttcctg agcgagtgga cccggcagcg ggcgataggg 60 
gggccaggtg cctccacagt yagccatggc agcgctgcgc tacgcggggc tggacgacac 120 
ggacagtgag gacgagctgc ctccgggctg ggaggagaga accaccaagg acggctgggt 180 
ttactacgcc aatcacaccg aggagaagac tcagtgggaa catccaaaaa ctggaaaaag 240 
aaaacgagtg gcaggagatt tgccatacgg atgggaacaa gaaactgatg agaacggaca 300 
agtgtttttt gttgaccata taaataaaag aaccacctac ttggacccaa gactggcgtt 360 
tactgtggat gataatccga ccaagccaac cacccggcaa agatacgacg gcagcaccac 420 
tgccatggaa attctccagg gccgggattt cactggcaaa gtggttgtgg tcactggagc 480 
taattcagga atagggttcg aaaccgccaa gtcttttgcc ctccatggtg cacatgtgat 540 
cttggcctgc aggaacatgg caagggcgag tgaagcagtg tcacgcattt tagaagaatg 600 
gcataaagcc aaggtagaaa caatgaccct ggacctcgct ctcctccgta gcgtgcagca 660 
ttttgctgaa gcattcaagg ccaagaatgt gcctcttcat gtgcttgtgt gcaacgcagc 720 
aacttttgct ctaccctgga gtctcaccaa agatggcctg gagaccacct ttcaagtgaa 780 
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tcatctgggg cacttctacc ttgtccagct cctccaggat gttttgtgcc gctcagctcc 840 

tgcccgtgtc attgtggtct cctcagagtc ccatcgattt acagatatta acgactcctt 900 

gggaaaactg gacttcagtc gcctctctcc aacaaaaaac gactattggg cgatgctggc 960 

ttataacagg tccaagctct gcaacatcct cCtccccaac gagctgcacc gtcgcctctc 1020 

cccacgcggg gtcacgtcga acgcagtgca tcctggaaat atgatgtact ccaacattca 1080 

tcgcagctgg tgggtgtaca cactgctgtt taccttggcg aggcctttca ccaagtccat 1140 

ggtttcagac tgcctggtag aaggaggtca cttctgattg tcagtgactt tgagctgagt 1200 

gctgaaataa aatgataaac aagtc 1225 

<210> 29 

<211> 2219 

<212> RNA 

<213> Homo sapien 

<220> mRNA 

<223> FOR II mRNA 

<400> 29 

tcgggccccg acgcgcgcgg gtctcgtttg gagcgggagt gagttcctga gcgagtggac 60 

ccggcagcgg gcgatagggg ggccaggtgc ctccacagty agccatggca gcgctgcgct 120 

acgcggggct ggacgacacg gacagtgagg acgagctgcc tccgggctgg gaggagagaa 180 

ccaccaagga cggctgggtt tactacgcca atcacaccga ggagaagact cagtgggaac 240 

atccaaaaac tggaaaaaga aeiacgagtgg caggagattt gccatacgga tgggaacaag 300 

aaactgatga gaacggacaa gtgtttttcg ttgaccatat aaataaaaga accacctact 360 

tggacccaag actggcgttt actgtggatg ataatccgac caagccaacc acccggcaaa 420 

gatacgacgg cagcaccact gccatggaaa ttctccaggg ccgggatttc actggcaaag 480 

tggttgtggt cactggagct aattcaggaa tagggttcga aaccgccaag tcttttgccc 540 

tccatggtgc acatgtgatc ttggcctgca ggaacatggc aagggcgagt gaagcagtgt 600 

cacgcatttt agaagaatgg cataaagcca aggtagaaac aatgaccctg gacctcgctc 660 

tgctccgtag cgtgcagcat tttgctgaag cattcaaggc caagaatgtg cctcttcatg 720 

tgcttgtgtg caacgcagca acttttgctc taccctggag tctcaccaaa gatggcctgg 780 

agaccacctt tcaagtgaat catctggggc acttctacct tgtccagctc ctccaggatg 840 

ttttgtgccg ctcagctcct gcccgtgtca ttgtggtctc ctcagagtcc catcgattta 900 

cagatattaa cgactccttg ggaaaactgg acttcagtcg cctctctgca acaaaaaacg 960 

actattgggc gatgctggct tataacaggt ccaagctctg caacatcctc ttctccaacg 1020 

agctgcaccg tcgcctctcc ccacgcgggg tcacgtcgaa cgcagtgcat cctggaaata 1080 

tgatgtactc caacattcat cgcagctggt gggtgtacac actgctgttc accttggcga 1140 

ggcctttcac caagtccatg caacagggag ctgccaccac cgtgtactgt gctgctgtcc 1200 

cagaactgga gggtctggga gggatgtact tcaacaactg ctgccgctgc atgccctcac 1260 

cagaagctca gagcgaagag acggcccgga ccctgtgggc gctcagcgag aggctgatcc 1320 

aagaacggct tggcagccag tccggctaag tggagctcag agcggatggg cacacacacc 1380 

cgccctgtgt gtgtcccctc acgcaagtgc cagggctggg ccccttccaa atgtccctcc 1440 

aacacagatc cgcaagagta aaggaaataa gagcagtcac aacagagtga aaaatcttaa 1500 

gtaccaatgg gaagcaggga attcctgggg taaagtatca cttttctggg gctgggctag 1560 

gcataggtct ctttgctttc tggtggtggc ctgtctgaaa gtaaaaacct gcttggtgtg 1620 

taggttccgt atctccctgg agaagcacca gcaattctct ttcttttact gttatagaat 1680 

agcctgaggt cccctcgtcc catccagcta ccaccacggc caccactgca gccgggggct 1740 

ggccttctcc tacttaggga agaaaaagca agtgttcact gctccttgct gcattgatcc 1800 

aggagataat tgtttcattc atcctgacca agactgagcc agcttagcaa ctgctgggga 1860 

gacaaatctc agaaccttgt cccagccagt gaggatgaca gtgacaccca gagggagtag 1920 

aatacgcaga actaccaggt ggcaaagtac ttgtcataga ctcctttgct aatgctatgc 1980 

aaaaaattcc ttagagatta taacaaattt ttcaaatcat tccttagata ccttgaaagg 2040 

caggaaggga agcgtatata cttaagaata cacaggatat tttggggggc agagaataaa 2100 

acgttagtta atccctttgt ctgtcaatca cagtctcagt tctcttgctt tcacattgta 2160 

cttaaacctc ctgctgtgcc tcgcatccta cgcttaataa aagaacatgc ttgaatatc 2219 
<210> 30 
<211> 711 
<212> RNA 
<213> Homo sapien 
<220> mRNA 
<223> FOR III tnRNA 
<400> 30 

tgccccgacg cgcgcgggtc tcgtttggag cgggagtgag ttcctgagcg agtggacccg 60 

gcagcgggcg ataggggggc caggtgcctc cacagtyagc catggcagcg ctgcgctacg 120 

cggggctgga cgacacggac agtgaggacg agctgcctcc gggctgggag gagagaacca 180 

ccaaggacgg ctgggtttac tacgccaaCc acaccgagga gaagactcag tgggaacatc 240 

caaaaactgg aaaaagaaaa cgagtggcag gagatttgcc atacggatgg gaacaagaaa 300 
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ctgatgagaa cggacaagtg ttttttgttg accata^aaa taaaagaacc acctacttgg 360 

acccaagact ggcgtttact gtggatgata atccgaccaa gccaaccacc cggcaaagat 420 

acgacggcag caccactgcc atggaaattc tccagggccg ggatttcact ggcaaagtgg 480 

ttgtggtcac tggagctaat tcaggaatag ggttcgaaac cgccaagtct tttgccctcc 540 

atggtgcaca tgtgatcttg gcctgcagga acatggcaag ggcgagtgaa gcagtgtcac 600 

gcattttaga agaatggaaa acaaaatacc accctccgcc agaaaagtgc agaataaaaa 660 

ttttccacta gcaaaagaag gaaaaaataa aagatcttga atagtctcat c 711 

<210> 31 

<211> 524 

<212> RNA 

<213> Homo sapien 

<22Q> mRNA 

<223> FOR IV irRNA 

<4O0> 31- 

tcgggccccg acgcgcgcgg gtctcgtttg gagcgggagt gagttcctga gcgagtggac 60 
ccggcagcgg gcgatagggg ggccaggtgc ctccacagtc agccatggca gcgctgcgct 120 
acgcggggct ggacgacacg gacagtgagg acgagctgcc tccgggctgg gaggagagaa 180 
ccaccaagga cggctgggtt tactacgcca agtaaggggg ccgcagtggg gccgcggacg 240 
cacctgggac cctgcacagc ccacggacgc cacctgcgcg gggaggacgc gcactccagc 300 
gcagcgcgtg cggtgcaaag tgaaagtaac tgttaaggag cttcagggaa aagggtccag 360 
ggttcccagt aggggccggc ccccttggtg ggcctcgggt ccagcggggg tcacctggtg 420 
gcttcccggc gcgccctctg ctgttcagga tgcagcactg cgcggcgcgg cgagggcaaa 480 
gcggcctcat ccccgccaaa aaatcuiagat gttttaaaaa gcgc 524 
<210> 32 
<211> 363 
<212> PRT 
<213> Hcmo sapien 
<220> 



<223> FOR I 


mRNA open reading frame 








<400> 32 




















Met 


Ala 


Ala 


Leu 


Arg 


TV^ 


Ala 


Gly 


Leu 


Asp 


Asp Thr 


Asp Ser Glu 


1 








5 










10 




15 


Asp 


Glu 


Leu 


Pro 


Pro 
20 


Gly 


Trp 


Glu 


Glu 


Arg 
25 


Thr Thr 


Lys Asp Gly 
30 


Trp 


val 


Tyr 


TVr 


Ala 
35 


Asn 


His 


Thr 


Glu 


Glu 
40 


Lys Thr 


Gin Trp Glu 
45 


His 


Pro 


Lys 


Thr 


Gly 
50 


Lys 


Arg 


Lys 


Arg 


Val 
55 


Ala Gly 


Asp Leu Pro 
60 


Tyr 


Gly 


Trp 


Glu 


Gin 
65 


Glu 


Thr 


Asp 


Glu 


Asn 
70 


Gly Gin 


Val Phe Phe 
75 


val 


Asp 


His 


He 


Asn 
80 


Lys 


Arg 


Thr 


Thr 


Tyr 
85 


Leu Asp 


Pro Arg Leu 
90 


Ala 


Phe 


Thr 


Val 


Asp 
95 


Asp 


Asn 


Pro 


Thr 


Lys 
100 


Pro Tte 


Thr Arg Gin 
105 


Arg 


Tyr 


Asp 


Gly 


Ser 
110 


Thr 


Thr 


Ala 


Met 


Glu 
115 


He Leu 


Gin Gly Arg 
120 


Asp 


Phe 


Thr 


Gly 


Lys 
125 


Val 


Val 


val 


Val 


Thr 
130 


Gly Ala 


Asn Ser Gly 
135 


He 


Gly 


Phe 


Glu 


Thr 
140 


Ala 


Lys 


Ser 


Phe 


Ala 
145 


Leu His 


Gly Ala His 
150 


Val 


He 


Leu 


Ala 


Cys 
155 


Arg 


Asn 


Met 


Ala 


Arg 
160 


Ala Ser 


Glu Ala Val 
165 


Ser 


Arg 


lie 


Leu 


Glu 
170 


Glu 


Trp 


His 


Lys 


Ala 
175 


Lys Val 


Glu Thr Met 
180 


Thr 


Leu 


Asp 


Leu 


Ala 
185 


Leu 


Leu 


Arg 


Ser 


Val 
190 


Gin His 


Phe Ala Glu 
195 


Ala 


Phe 


Lys 


Ala 


Lys 
200 


Asn 


Val 


Pro 


Leu 


His 
205 


Val Leu 


Val Cys Asn 
210 


Ala 


Ala 


Thr 


Phe 


Ala 
215 


Leu 


Pro 


Trp 


Ser 


Leu 
220 


Thr Lys 


Asp Gly Leu 
225 


Glu 


Thr 


Thr 


Phe 


Gin 
230 


Val 


Asn 


His 


Leu 


Gly 
235 


His Phe 


Tyr Leu Val 
240 


Gin 


Leu 


Leu 


Gin 


Asp 
245 


Val 


Leu 


Cys 


Arg 


Ser 
250 


Ala Pro 


Ala Arg Val 
255 
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lie Val Val Ser Ser Glu Ser His Arg Phe Thr Asp lie Asn Asp 

260 265 270 

Ser Leu Gly Lys Leu Asp Phe Ser Arg Leu Ser Pro Thr Lys Asn 

275 280 285 

Asp Tyr Trp Ala Met Leu Ala Tyr Asn Arg Ser Lys Leu Cys Asn 

290 295 300 

lie Leu Phe Ser Asn Glu Leu His Arg Arg Leu Ser Pro Arg Gly 

305 310 315 

Val Thr Ser Asn Ala Val His Pro Gly Asn Ket Met Tyr Ser Asn 

320 325 330 

He His Arg Ser Trp Trp Val Tyr Thr Leu Leu Phe Thr Leu Ala 

335 340 345 

Arg Pro Phe Thr Lys Ser Met Val Ser Asp Cys Leu Val Glu Gly 

350 355 360 

Gly His Phe 

<210> 33 

<211> 414 

<212> PRT 

<213> Homo sapien 

<220> 

<223> FOR II mRNA open reading frame 
<400> 33 

Met Ala Ala Leu Arg Tyr Ala Gly Leu Asp Asp Thr Asp Ser Glu 
15 10 15 

Asp Glu Leu Pro Pro Gly Trp Glu Glu Arg Thr Thr Lys Asp Gly 

20 25 30 

Trp Val Tyr Tyr Ala Asn His Thr Glu Glu Lys Thr Gin Trp Glu 

35 40 45 

His Pro Lys Thr Gly Lys Arg Lys Arg Val Ala Gly Asp Leu Pro 

50 55 60 

lyr Gly Trp Glu Gin Glu Thr Asp Glu Asn Gly Gin Val Phe Phe 

65 70 75 

Val Asp His He Asn Lys Arg Thr Thr Tyr Leu Asp Pro Arg Leu 

80 85 90 

Ala Phe Thr Val Asp Asp Asn Pro Thr Lys Pro Itir Thr Arg Gin 

95 100 105 

Arg Tyr Asp Gly Ser Itir Thr Ala Met Glu He Leu Gin Gly Arg 

110 115 120 

Asp Phe Thr Gly Lys Val Val Val Val Thr Gly Ala Asn Ser Gly 

125 130 135 

He Gly Phe Glu Thr Ala Lys Ser Phe Ala Leu His Gly Ala His 

140 145 150 

Val He Leu Ala Cys Arg Asn Met Ala Arg Ala Ser Glu Ala Val 

155 160 165 

Ser Arg He Leu Glu Glu Trp His Lys Ala Lys Val Glu Thr Met 

170 175 180 

Thr Leu Asp Leu Ala Leu Leu Arg Ser Val Gin His Phe Ala Glu 

185 190 195 

Ala Phe Lys Ala Lys Asn Val Pro Leu His Val Leu Val Cys Asn 

200 205 210 

Ala Ala Thr Phe Ala Leu Pro Trp Ser Leu Thr Lys Asp Gly Leu 

215 220 225 

Glu Thr Thr Phe Gin Val Asn His Leu Gly His Phe Tyr Leu Val 

230 235 240 

Gin Leu Leu Gin Asp Val Leu Cys Arg Ser Ala Pro Ala Arg Val 

245 250 255 

He Val Val Ser Ser Glu Ser His Arg Phe Thr Asp He Asn Asp 

260 265 270 

Ser Leu Gly Lys Leu Asp Phe Ser Arg Leu Ser Ala Thr Lys Asn 

275 280 285 

Asp Tyr Trp Ala Met Leu Ala lyr Asn Arg Ser Lys Leu Cys Asn 

29Q 295 300 

He Leu Phe Ser Asn Glu Leu His Arg Arg Leu Ser Pro Arg Gly 

305 310 315 
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Val Thr Ser Asn Ala Val His Pro Gly Asn Met Met Tyr Ser Asn 

320 325 330 

lie His Arg Ser Trp Trp Val Tyr Thr Leu Leu Phe Thr Leu Ala 

335 340 345 

Arg Pro Phe Thr Lys Ser Met Gin Gin Gly Ala Ala Thr thr Val 

350 355 360 

Tyr Cys Ala Ala Val Pro Glu Leu Glu Gly Leu Gly Gly Met Tyr 

365 370 375 

Phe Asn Asn Cys Cys Arg Cys Met Pro Ser Pro Glu Ala Gin Ser 

380 385 390 

Glu Glu Thr Ala Arg Thr Leu Trp Ala Leu Ser Glu Arg Leu lie 

395 400 405 

Gin Glu Arg Leu Gly Ser Gin Ser Gly 

410 

<210> 34 

<211> 189 

<212> PRT 

<213> Homo sapien 

<220> 

<223> FOR III mRNA open reading frame 
<400> 34 

Met Ala Ala Leu Arg lyr Ala Gly Leu Asp Asp Thr Asp Ser Glu 
15 10 15 

Asp Glu Leu Pro Pro Gly Trp Glu Glu Arg Thr Hhr Lys Asp Gly 

20 25 30 

Trp Val Tyr Tyr Ala Asn His Thr Glu Glu Lys Thr Gin Trp Glu 

35 40 45 

His Pro Lys Thr Gly Lys Arg Lys Arg Val Ala Gly Asp Leu Pro 

50 55 60 

Tyr Gly Trp Glu Gin Glu Thr Asp Glu Asn Gly Gin Val Phe Phe 

65 70 75 

Val Asp His lie Asn Lys Arg Thr Thr Tyr Leu Asp Pro Arg Leu 

80 85 90 

Ala Phe Thr Val Asp Asp Asn Pro Thr Lys Pro Thr Thr Arg Gin 

95 100 105 

Arg Tyr Asp Gly Ser Thr Thr Ala Met Glu lie Leu Gin Gly Arg 

110 115 120 

Asp Phe Thr Gly Lys Val Val Val Val Thr Gly Ala Asn Ser Gly 

125 .130 135 

lie Gly Phe Glu Thr Ala Lys Ser Phe Ala Leu His Gly Ala His 

140 145 150 

Val lie Leu Ala Cys Arg Asn Met Ala Arg Ala Ser Glu Ala Val 

155 160 165 

Ser Arg lie Leu Glu Glu Trp Lys Thr Lys Tyr His Pro Pro Pro 

170 175 180 

Glu Lys Cys Arg lie Lys lie Phe His 

185 

<210> 35 

<211> 36 

<212> PRT 

<213> Homo sapien 

<220> 

<223> FOR IV mRNA Open reading frame 
<400> 35 

Met Ala Ala Leu Arg Tyr Ala Gly Leu Asp Asp Thr Asp Ser Glu 
15 10 15 

Asp Glu Leu Pro Pro Gly Trp Glu Glu Arg Thr Thr Lys Asp Gly 

20 25 30 

Trp Val Tyr Tyr Ala Lys 

35 

<210> 36 

<211> 999 

<212> DMA 

<213> Homo sapien 
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<220> Exon 

<223> FOR Exon 1 and flanking introns 
<400> 36 

ccaggccctg cccctttgac gccggccgtc gcgatattgc ggagactgga tttcagcttc 60 

gtggtcggcg gagcggcccc tggagggcgc agtgcgcagg cgtgagcggt cgggccccga 120 

cgcgcgcggg tctcgtttgg agcgggagtg agttcctgag cgagtggacc cggcagcggg 180 

cgataggggg gccaggtgcc tccacagtca gccacggcag cgctgcgcta cgcggggctg 240 

gacgacacgg acagtgagga cgagccgccc cgggctggga ggagagaacc accaaggacg 300 

gctgggttta ctacgccaag taagggggcc gcagtggggc cgcggacgca cctgggaccc 360 

tgcacagccc acggacgcca cctgcgcggg gaggacgcgc actccagcgc agcgcgtgcg 420 

gtgcaaagtg aaagtaactg ttaaggagct tcagggaaaa gggtccaggg ttcccagtag 480 

gggccggccc ccttggtggg cctcgggtcc agcgggggtc acctggtggc ttcccggcgc 540 

gccctctgct gttcaggatg cagcactgcg cggcgcggcg agggcaaagc ggcctcatcc 600 

ccgccaaaaa ataaagatgt tttaaaaagc gcacatgctc agctccctcc tgcaggctct 660 

gggttgcagg gataggagtt ttgttgtgtt ttgttttgtt ttgtccagac cggtattgct 720 

cagtcatcca ggctggagtg caggggtgcc atcatagccc actgtagcct ctacctactg 780 

ggttcagaca atcctctcat ctcagcctct tgagtggctg ggactacaag cgtgcactac 840 

tatgcccgac taatttttta agtatttgta gagactaggg tcgcaccatg ttgcccaggc 900 

tggtctcgaa ctactgggct caagcaggcc acctgcctca gcctccagaa ttgagatcac 960 

aggcgtgagc cactgcgcct agccaggagt attttttag 999 
<210> 37 
<211> 1000 
<212> ENA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon lA 
<400> 37 

ccaggccctg cccctttgac gccggccgtc gcgatattgc ggagactgga tttcagcttc 60 

gtggtcggcg gagcggcccc tggagggcgc agtgcgcagg cgtgagcggt cgggccccga 120 

cgcgcgcggg tctcgtttgg agcgggagtg agttcctgag cgagtggacc cggcagcggg 180 

cgataggggg gccaggtgcc tccacagtca gccatggcag cgctgcgcta cgcggggctg 240 

gacgacacgg acagtgagga cgagctgcct ccgggctggg aggagagaac caccaaggac 300 

ggctgggttt actacgccaa gtaagggggc cgcagtgggg ccgcggacgc acctgggacc 360 

ctgcacagcc cacggacgcc acctgcgcgg ggaggacgcg cactccagcg cagcgcgtgc 420 

ggtgcaaagt gaaagtaact gttaaggagc ttcagggaaa agggtccagg gttcccagta 480 

ggggccggcc cccttggtgg gcctcgggtc cagcgggggt cacctggtgg cttcccggcg 540 

cgccctctgc tgttcaggat gcagcactgc gcggcgcggc gagggcaaag cggcctcatc 600 

cccgccaaaa aataaagatg ttttaaaaag cgcacatgct cagctccctc ctgcaggctc 660 

tgggttgcag ggataggagt tttgttgtgt tttgttttgt tttgtccaga ccggtattgc 720 

tcagtcatcc aggctggagt gcaggggtgc catcatagcc cactgtagcc tctacctact 780 

gggttcagac aatcctctca tctcagcctc ttgagtggct gggactacaa gcgtgcacta 840 

ctatgcccga ctaatttttt aagtatttgt agagactagg gtcgcaccat gttgcccagg 900 

ctggtctcga actactgggc tcaagcaggc cacctgcctc agcctccaga attgagatta 960 

caggcgtgag ccactgcgcc tagccaggag tattttttag 1000 
<210> 38 
<211> 1000 
<212> DMA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 2 
<400> 38 

ctacaggcac gtgccactgt ccccagctaa ttttgtattt ttggtagaga cagggtttca 60* 

ccatgttggc caggatggtc tccatctcct gaccttgtga tctgctcccc tagggctccc 120 

eiaagtgctgg gattacaggt gtgagccacc gctcccgacg gcctctagat attttgagtg 180 

attcagcaaa cctcctaaag ttgaccccgt agctggggtc acagtcctct ttctccttct 240 

tccccctact tccttcttat atctggctat ctgggagaga aaaaatttaa tacaattgat 300 

tactttttag aagagttaat ttttacttat tactgtggat tttttgtttt ttaacagtca 360 

caccgaggag aagactcagt gggaacatcc aaaaactgga ciaaagaaaac gagtggcagg 420 

aggtttgtat gttgttgtct aaggatcttg gatggaagca ttaagtagat gaggaaatgt 480 

cactggcaga gaggtgacag gttattgtgt gtttagggag ggctctctgg tagagaacca 540 

gactcccact ctgaggagct tatgggattt ggcagaagaa gatgatgaga tcatagggtg 600 

gagtgaggat gatttcttac tggtcttaaa gtaccaaaat ggccagatgt ggtggctcat 660 

gcctgtaatc ccagcacttc gggaggccga ggcggatgga tcacctgagg tcaggagttt 720 

gagactagcc tggccaacat gttgaaagcg catctcttct aaaaatacaa aaattagctg 780 
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ggcatggttg catgcgcctg taattccagc tactt^ggag gctgaggcac gagaagtgct 840 

ceiaacccagg agctggaggt tgcagtgagc caagatcaca ccactgcgct ccagcctcgg 900 

tgacagagtg agatgctgtc tgaaaaatag cgacsiacaac aacaacaaaa accagaaaat 960 

aattgaggcc tttgattcac agtttatgat gttttcctat 1000 

<210> 39 

<211> 1000 

<212> DNA 

<213> Homo sapien 

<220> Exon 

<223> FOR Exon 3 

<400> 39 

acaagaggca aaaatgtgga gcccagggtg ggatcagggg ccttctgcag ctgtcgcagt 60 

tggaacatgt gacgaaagcc agttgatgtg acaactgctg ggtgggaggg acaggcttgg 120 

gggcggggct gggagggctc cttcccttcc tgacccaggg atggtcttta cttctccctg 180 

gcacctgtag acctgtcttt cttgtgtttc agatttgcca tacggatggg aacaagaaac 240 

tgatgagaac ggacaagtgt tttttgttga gtaagtgtct gcaaagaaac cactctcagc 300 

tgttttgctt tttaatagga atttttaatt ataaaagtaa tacatagtaa ctgtagaaaa 360 

atacaaagta taacagtgtg tatctgtaat cttagcagaa gtgactactt ttaacatcgc 420 

tggaactttt aacatogcta gatttattct tctattttcc ccgcacacgc atccttaaaa 480 

aaaaaaagta gggaggccag gcatggtggc tcatgcctgt aatcccagca ctttgggagg 540 

tcgaagtggg tggatcctga ggtcaagaga tcgagaccat tctggccaac atggtgaaac 600 

tccgtctcta ctaaaaatat aaaaatgagc tgggtgtggt ggcacgtgcc tgtggtccct 660 

gctactcggg agggtaaggc gggagaatca cttgaaccca ggaggcggaa gttgcagtga 720 

gctgagatcg cgccactgca ctccagcctg ggtgacagaa tgagactcct tctcaaaaaa 780 

ciaeiaaaaaaa aaaaaagagg gaatgcactg tgtggactgt ttagtaacct gctttttcca 840 

tgtaatatat tatgagcatt ttcttgtatc catatctatt tttcaaaaag atgctattta 900 

gcagctctag agttattatt ttgtacagat atactataat ttatcaaatt gcctattgga 960 

cattaatgct aaattcctgt tatagagaac ctataattat 1000 
<210> 40 
<211> 1000 
<212> DMA 
<213> Homo sapien 
<220> Exon 
<223> FXDR Exon 4 
<400> 40 

cttctctacg tctttaggcc cttgcacagg gctaagagat ttaattgaaa tctactgaaa 60 

agagtggctg acagatctta tagctacatt tacatgaatt acataaaagc ccaaaacctt 120 

ctcaagaagc cttttttgag atctaaggat acatggcaat agttattgta tgttacagcg 180 

tatgttacag gcagttctga aggaaaggat tcgatgacta tctttttttg gcacaaatgt 240 

gatccttctg gaggccagaa gatagattca gtgggcccca gttctttcag gtttaaggaa 300 

taagcatttt ggtctatgaa aaatggggtt ttcctaaagt ataagattgt cttatattta 360 

taaatgcctg tgttcattgc tgtgggttca ctgctttctc ttttgggcag ccatataaat 420 

aaaagaacca cctacttgga cccaagactg gcgtttactg tggatgataa tccgaccciag 480 

ccaaccaccc ggcaaagata cgacggcagc accactgcca tggaaattct ccagggccgg 540 

gatttcactg gcaaagtggt tgtggtcact ggagctaatt caggaatagg taggctcttc 600 

acttagttat ttacctttgg gactgctata atgagatcca cttagatcta gctataatgg 660 

aattttgttt agtggttctc tgatttaaac atgactttta tccttttcag atatcgtttc 720 

attaacatca ctacctcttt ttaaatccta atgttgtcat ggaagcctgt gtaggggctg 780 

accttgaagt ctctgaaagc tgaacactca gcaaaagact gtggctattt tggtattcag 840 

ggatgagaga caacaggctc cgtctaagag tttttgacct ggtctgcatg gtgtatgggc 900 

atattccaat tacctgggtt attcaaacca aaattgattg gtaatagaat acggggaaca 960 

acaaggtatt atgtctttgg aacaaaggat actacaggtt 1000 
<210> 41 
<211> lOOO 
<212> DMA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 5 
<400> 41 

gtttttgcag atcttggccc ccaaaattct tagtcatagt gcccttctga tgccttcaaa 60 

cagatatggg tgtgtatgca cgtgtgttat tttgtccatc ttttctaatt gttctcagta 120 

ggaaatgggg ctaagaaacc taggtaacac tggtagaagc agaggtgctg gcgacactta 180 

ccatagagta gagcataaag tccttatcgt gggcacagga tctggtcatt attgatggtt 240 

aaatggctgg acatgccgtg aactcttcta tgtcccagtc ttcatctttg aiaatcactgc 300 
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atccctagga caagttttct caeca tcaca gtgttgacag tccgggccag atggttctat 360 

gttgtcaggg gctgccctgt tcatggteiag atgttttgta atgcctggtg tttacccact 420 

agatgccagt aggactctac cccacaactg tggccactga aaatgtctcc agacatttgc 480 

ttctgtcccc tggggatcaa aatcacccct ggttgagaac ttggggtaat ttaagtggtg 540 

ctccggtaaa ggccattcaa catgactcac tgtgttgatg ttatgttttc taacattgac 600 

tttcctttaa accatagggt tcgaaaccgc caagtctttt gccctccatg gtgcacatgt 660 

gatcttggcc tgcaggaaca tggcaagggc gagtgaagca gtgtcacgca ttttagaaga 720 

atgggtaagt gcttgactgt tgttgttttt tttaattgtc aaatacacat gccgggctaa 780 

ccatatggag atttcagtgg agtgtgtaaa gtttattgct cttggaatca tgtctttatt 840 

tttaaagtca tcttcacttt gtttgtctta tgaatgaaag catgagcaag tccattatta 900 

ttcacgcatt tgtttgtagt tcattgtcag tgtcatctgc tttattagtc aggggtttga 960 

gaatgtttct tcctttaact catttattaa gaaaaataaa 1000 
<210> 42 
<211> 1000 
<212> DMA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 6 
<400> 42 

gcttttgaaa ttaagatatt gtccattgca tctttccatg atcgaggcgt tgtcgtgcat 60 

tcccctcctt acagcgtttt aggatgcaga ttgagaggag ataacgttag cccgaagcgc 120 

ctttgcagcc tcttttgcgc ggctgaaaca cagcacaacc cacccccccg ccccaccccc 180 

cagcctgtag gtttagcaga atcccagcct cacatcctcc ccgaacttgg cagtaaaagc 240 

cctgttcttc cat teat tec gatgatttat attctctctg ggcgtcttat attaaacagg 300 

ggaattccga catgttccat aacacattta ctgtaacttg ataccatgaa ctacaettgc 360 

tgttatttat catttctttt tattttctct cattgcagca taaagccaag gtagaaacaa 420 

tgaccctgga cctegetctg ctccgtagcg tgcagcattt tgctgaagca ttcaaggcca 480 

agaatgtgtg agtgttccag tggagggtta tagatcataa tttcttgcta ttgtaatatc 540 

tttatcagat gaacacaatt gggagaatgc aaggctgttg tgttgtcttg gcgtccaaac 600 

aggaggctca tttatattgg ccctgttaag gtgaiaccgta ttttcttgac tcaeagtcac 660 

cttcattatg agatgtgtca tcciatctaat aacagcttcc cacataccaa aagagaagac 720 

actattaaag cactagtaaa agtggctaat aaaagcttgg caatagtaag atgcatcctg 780 

attataagat tttttgtagt geatttcaga atggagtaag agtatattta aattgcattc 840 

aggaacaagt aaactcaatt atccaatatg gcagggaggt tgacaatcca ageaeccaaa 900 

aaacctctag tttctaaagc cttcgatgat ttgatgtggt acatggatgt ggttccaaaa 960 

aacatggact cacattcctt ttatttattt tttttcatcc 1000 
<210> 43 
<211> 1000 
<212> DNA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 6 A 
<400> 43 

acatttgttt ctctccagga tctcattggc ggtttgctgg tattatattg ccatttatat 60 

tggaaaggca ggcagtgcga aaactgtagt ctattaagag tttgtgagtg tgtgtatgtg 120 

tatacagtga aataatatct agtgtaatgt gaagtgacag tcaaaattac agcttttctc 180 

ttgccgacaa gggtacttct ctgeeaagta tagtaatatt ttggtttttt tgttcaaggt 240 

tgccagcaat tctggcttac tctcttgaga gecatattag tgtgtggaag gaaggagctt 300 

cctgtcaggg gaaatactat tattttgata tatatttgtt ggtgccaaac tgcaattaaa 360 

taggaggtgt tggaagaagg ttttatttaa nccagcttga atttgttaat taattagcta 420 

aaacttattt ttggtttgct tcaagaaaca gttctatgtt gttttcataa tttaatttta 480 

cacaactgtc ttttgtttgt atcttacaga aaacaaaata ccaccctccg ccagaaaagt 540 

gcagaataaa aattttccac tagcaaaaga aggaaaaaat aaaagatett gaatagtctc 600 

atcaattaea tcttcttttg tgggtatttc ctggtctttt catgttttga ettctatctc 660 

ggtgatctga cagacatgta cgatttgcaa caacatctat tatatgattt aatttatacc 720 

tttatcagtt tgcagttatg ctttacgact cttcaggtga ctaaaganaa aagagggtgt 780 

tttaaagtgt gtgtgtttgt gtgtgcgtgt acgtgtatac atgtgttttt ctgtgtacat 840 

aagtgtgtgt gttttaaatt gtttgaaaaa cactagntcc atctctattg tattatctga 900 

gggatgctag ctcgttactt gggatagtaa caagtttctt ggtgacaact cetncctecc 960 

attgtcaagg aagactgagg atgtcaccag agttcgctgt 1000 
<210> 44 
<211> 1000 
<212> DNA 
<213> Homo sapien 
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<220> Exon 
<223> FOR Exon 7 
<400> 44 

gtggttgaac ctgagttcca actagggtga caactcttcg cagattgcct gccgctgtcc 60 

tggttttcac actgaaagtt ccgtatccta ataagcctct catcctcagc aacggaggac 120 

agttgcccac tcaaagcctt gtgacattct agggtatcct atttctacat ggtgggaatc 180 

tagaagacgg agaaagaatt tctcattccc gaaggagcat ggattatcct tggttgtagt 240 

gtttatgttc cacatcacgt ggattcccga aggagcatgg attatccttg gttgtagtgg 300 

ttatgtccac atcacatggg atattttatt tttcaggcct cttcatgtgc ttgtgtgcaa 360 

cgcagcaact tttgctctac cctggagtct caccaaagat ggcctggaga ccacctttca 420 

agtgaatcat ctggggcact tctaccttgt ccagctcctc caggatgttt tgtgccgctc 480 

agctcctgcc cgtgtcattg tggtctcctc agagtcccat cggtgggttt gaattgcata 540 

tttgttcact tatcccctct ctcataccag ctaatattcc cccaaggctc tcattctgaa 600 

aataattttc attagtcctg cttgagacat gtgggtggac tcagcttggc tcacttaatt 660 

tctccaggtc ttttttgttc gcctgcgatt gtgggggact gtttagaagg actttctaga 720 

gcaaggaaga ttgcctttac gactatactt caagctcctc attgattttc gcttacagat 780 

ggaataataa cttcatgaaa aactcaatgg catgaaccta ttattggatt tgtaattcaa 840 

caacttcaac atcttaccaa gaagaatgtg cagttattct agcaggagaa acaatgcaat 900 

tagagcctgc gagatgaaat caaattgttt tataatgaga aattagggaa ttcgaggcag 960 

acattagctg tgtaattgtg gaaagggaag aactgtagtt 1000 
<210> 45 
<211> 1000 
<212> DNA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 8 
<400> 45 

tttttgtatt tttagtagag atggcatttt gccatgttgg ccaggctggt cttgaactcc 60 

cgacctcagg tgatccactc gtctaagact cccaaagtgc tcggattaca gatgtgagcc 120 

actgcaccca gcattcctta gatttccaat aaaaataaaa agctgtgtgg gaagtcagaa 180 

cttggttgct tcatgtcata tttcctattt ttaagattta cagatattaa cgactccttg 240 

ggaaaactgg acttcagtcg cctctctcca acaaaaaacg actattgggc gatgctggct 300 

tataacaggt ccaagctctg caacatcctc ttctccaacg agctgcaccg tcgcctctcc 360 

ccacgcgggg tcacgtcgaa cgcagtgcat cctggaaata tgatgtactc caacattcat 420 

cgcagctggt gggtgtacac actgctgttt accttggcga ggcctttcac caagcccatg 480 

gtciagagaac agcttctggc gccgcaaaca ccttgggtcc tagagaaacc tgcacacttg 540 

tgtctccacc tttttacctc ttgcgggcat gagtctggtc tcagtaataa cattgtccag 600 

cccatcataa agggctcttg aacacatttt catcaacttt aggttciagtc tgtttgggta 660 

aatgcgtctt ggagggctgg gtagaagatg tgggtttcag tatcatgtta agtatggctg 720 

aaagtcctta tggaaatggt gatttttttg tttgtttggt tttgtttttt ttggggtttt 780 

ttattcagaa actttgaaaa tctattttgt tgaatggagc acttgaaaac tgctgttttg 840 

tgtcagtagg taaacaacaa acattggtga ctactgaatt ttcagcagat gtgattcctt 900 

tgtttcacag aaaaactgga tcttttgttc taeiatttttt tcttctaatg ggtataatcc 960 

tctgttggag agtcctttga tagctaggag tgtgttttct 1000 
<210> 46 
<211> 1000 
<212> CNA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 9 
<400> 46 

tagaccccct ttgataatct ccactaagca gacatactcg atacatcttc actaatgagt 60 

tctgacttca taaaaagtat taatgacttc tttttgaaag taagagtgct ttgaatacca 120 

gtcgttattg ctttagaagt tcataaaagc aaaagcacag tatttccccc agtgtttgtg 180 

cgataagaga atagaatgta ggtcccagcg ccttagaatt ttaagctatg ccttctcttg 240 

gtttgtgaat ttccaggttt cagactgcct ggtagaagga ggtcacttct gattgtcagt 300 

gactttggtg agttcttacc ttgtaaaaga tttacaatta tttcattttc aacatagctt 360 

tatcttatga caaaggtgac agaaaggaaa tctcctaagt tggcctacag ggtgctttag 420 

aaaacatctg gctgggcatg gtggttcaca cctgtaatct ccacactttg ggaggctgaa 480 

gtaggctgaa gtgggaggat ggttagagcc taggagttcg agaccagtct gggcaacaac 540 

gtgagatcct gtctctacaa aaaataaaaa aaattatctg ggtatagtgg tgtgcacctg 600 

aagtcccagc taactgggag tctgaggcaa ggaaattgtt tgagcctagg aggttgagag 660 

tgcagtgagc cgtgttgctg ccactgtact ccagcctggg caacaggaca agaccgtgtc 720 

tccaaaagga aaaaaataat ciaagcactgt ctctctctac ccttgcagta tccctgtagg 780 
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agagagttac tattagctcc cagtttatag gtgagtgata tggtttggat gtgtccccac 840 

ccaaatctca acttgaattg tatctgccag aattcccaca tgttgtggga gggacccagg 900 

gggaggtaat tgaatcatgg ggcccagcct ttcccatgct attctcataa tagtgaataa 960 

gtctcatgag atctgatggg tgtatcagga gtttccgctt 1000 

<210> 47 

<211> 1500 

<212> DNA 

<213> Hano sapien 

<220> Exon 

<223> FOR Exon 9A 

<400> 47 

aagtcgaggg gttggatgaa tttgttcttg cacgttcaga aggataccat ctttttctct 60 

gtgtggaaga ggcgcctgcc acgacgtatt gatatgtgta tttattttcc aagcctgtcc 120 

ctgtatgagg tgtcaaaagt tacaccagct ttacaagcgg agtttatgaa ctcgtttttc 180 

caggatagtc acattatact ttttacagtc atgtgctttc agcccagtac cctttgctat 240 

gccaagatcc agctgaaact gaaccaggtg ggggaggcct gctaatgccc aggcagtcga 300 

aatgacgcca tctcatcact ccttctctta eiaattttttt ttgtctttct tcttggactt 360 

ccagcaacag ggagctgcca ccaccgtgta ctgtgctgct gtcccagaac tggagggtct 420 

gggagggatg tacttcaaca actgctgccg ctgcatgccc tcaccagaag ctcagagcga 480 

agagacggcc cggaccctgt gggcgctcag cgagaggctg atccaagaac ggcttggcag 540 

ccagtccggc taagtggagc tcagagcgga tgggcacaca cacccgccct gtgtgtgtcc 600 

cctcacgcaa gtgccagggc tgggcccctt ccaaatgtcc ctccaacaca gatccgcaag 660 

agtaaaggaa ataagagcat tcacaacaga gtgaaaaatc ttaagtacca atgggaagca 720 

gggaattcct ggggtaaagt atcacttttc tggggctggg ctaggcatag gtctctttgc 780 

tttctggtgg tggcctgttt gaaagtaaaa acctggttgg cgtgtaggtt ccgtatctcc 840 

ctggagaagc accagcaatt ctctttcttt tactgttata gaatagcctg aggtcccctc 900 

gtcccatcca gctaccacca cggccaccac tgcagccagg ggctggcctt ctcctactta 960 

gggaagaaaa agcaagtgtt cactgctcct tgctgcattg atccaggaga taattgtttc 1020 

attcatcctg accaagactg agccagctta gcaactgctg gggagacaaa tctcagaacc 1080 

ttgtcccagc cagtgaggat gacagtgaca cccagaggga gtagaatacg cagaactacc 1140 

aggtggcaaa gtacttgtca tagactcctt tgctaatgct atacaaaaaa ttctttagag 1200 

attataacaa atttttcaaa tcattcctta gataccttga aaggcaggaa gggaagcgta 1260 

tatacttaag aatacacagg atattttggg gggcagagaa taaaacgtta gttaatccct 1320 

ttgtctgtca atcacagtct cagttctctt gctttcacat tgtacctaaa cctcctgctg 1380 

tgcctcgcat cctacgctta ataaaagaac atgcttgaat atcatcacct gaagtttgta 1440 

ttgtttcttt aaatgtttgt ttcagtttgt ttttgttttt cattttttag aaaagaaatc 1500 
<210> 48 
<211> 595 
<212> DMA 
<213> Homo sapien 
<220> Exon 
<223> FOR Exon 10 
<400> 48 

ctttttctca agtgttgaaa taaattaatg gtgtggccat ggttctcgta cttcaggtcc 60 

ctgtgagccc ctggggtcct acacaacttg gggtaatgct atggtcacct gcatcccctc 120 

ctctccagcc ctcacctggt tttctcttct ccttccaaga aaaaacaccc ataccattct 180 

taacatctct ggaagctggg ggtcaggaga gggagatcca taaagttctt gtcttccccg 240 

actcaagaag cttaagggta cattcactca ctcatgaatt gactggttta ttcattcact 300 

ggttgattaa ttcattcatt cactggttaa ttaattaatt catgtcatcc tttggatgtc 360 

ttgcagagct gagtgctgaa ataaaatgat aaiacaagtca aaaacaaaaa ggcctctgac 420 

ttaacaggac ttccgtgatg cgggggaaga agaceiatagg caagtaagca agtaaatata 480 

caagataaat agaaactgtg atgagggagt taaagaaatt aaactggtcc ttgtggaagc 540 

aacttcgggg gaggaggaaa gacagggaag accattttaa ggaggtgatg tttga 595 

<210> 49 

<211> 959 

<212> DNA 

<213> Homo sapien 

<220> Exon 

<223> FOR Exon lOA 

<400> 49 

aacagaaaaa catgccatca tctttaattt cctgcatcag ctaagcattt atttcatgta 60 

gtcctcacac tagattgagt ggttccatat taatatccta cgtcaaagat gagtaaattg 120 

agatgcgtag gttctaattt gcccagtgtc gtggtgtgtg atcttgacca ctagcctaaa 180 

ctgctatttt atgtccaccc atcaacctct acggctgccc ctcatttgaa cacacaaaat 240 
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atagttgtgc ggcttggtgg agccatctaa attccg1:tag gccatttgcc aatgctgcta 300 

ttaggggcga aatgtcatgc gcttgatcta aatgtactta ggagaattct caggacctga 360 

tgaattatta ttcggaattt tatggcctca caggttgcag gcttcatacc aactgcagct 420 

aatgagctat gggccccgag aaacactgag gacacacggc gttctgcaca cagagtgggc 480 

tgtttctgtc tgttctcccc ctgcaccctt ctcagatgca atctcaagtc ataggagaac 540 

ttgtgcaaat gtttctcctg gatggtttcc tttagagcat gtgtcctata acttgaeiatg 600 

gttgtctgag cagaatgttt ttagaagtta gattttttta ggggggaaac aggaaccaaa 660 

gcaaagccaa tagagatctt gaaaiaaaaaa aaaaaaagaa aaaaccaccg tggtaCCcCa 720 

ggaagaaaaa agcatttttc gaatgaaaac tttttattat atttgatata ttctgcttcc 780 

cttccctagt atgtattaat gagatgaaat cacttcttaa ttttcaggtt aatattaaag 840 

ttgaagccca tccctctacc ctgaggactc tgccagcctc tggcagtatt cctttccaac 900 

ttccacttgc cccaaatagg tagaagttag cctttatttt tggtgtcatg tcttctttc 959 

<210> 50 

<211> 921 

<212> DMA 

<213> Homo sapien 

<220> Exon 

<223> FOR Exon lOB 

<400> 50 

tttctccccg acatgcccct gccaacctcc ccccatcatt ccagctggga gaggccatca 60 

aatgtagctt ggagagctat agaaaccttg agctctgagc ccaaggagca agttggagtc 120 

ctgcctccag ctgtgtagct cagataacag ctggagcaga gaacatgctg ttctctctgc 180 

tagaatatct gacccaattc tgactagtaa agagagttaa tagtttgtac ctcaagtcat 240 

tcttgtccct gtgtgaagga agagaagcag tcatttccct ccacctccaa cacacactcc 300 

gtccccactc tgtctctctt ggctgttttt ctcttcattg ttgttaaact cacgttcatt 360 

tctcCtgtag tcagggaaat gaagagccac tttcaacaat tctgaagaag aaatatggga 420 

tgttttgctt cgggatggag gctggagcag gagtttctgg agtctgccag tacccagttt 480 

gaatcccagc tctgcctttt atcagctggg tgttgggcaa gttagctgac ttttgtgagt 540 

tttctcatca ttaaaatgag aacactgtta ttggtctttc gggattgttt tgagaaatga 600 

gatatcgaga catgcctggc acaaggcctt aattcttctt catggtcaag aaatggcaga 660 

ttttccccct tccattccca cccttgcaca tagtaggttc tcagcaagta tttgtagatg 720 

taatcgacca gcagagatca tttgtaccct taacacccac agagagtcac agatgctttc 780 

actgaaggag ggtgtcccaa gactcaatgg cagggaataa aaatgccaag tcatgtaagt 840 

attccacaaa gttagagggg aggagtaagt acctcttatt cgtgcatctt tatggtatga 900 

ccaagggctc atgatttgta a 921 
<210> 51 
<211> 15004 
<212> DMA 
<213> Homo sapien 
<220> 

<223> DMA region encoinpassing exon 6 
<400> 51 

gatgggcgtt tattatgaga tacacgaaga cagagaccag aggttcccca agtagcccct 60 

ggacctgcag tgtcagcatc acctggtagc ttgttacaca taaaaattct tgggctttat 120 

tccacacata ataaatcaiaa agcttgtggg gtggggcctg cagtctgagg tttatttttt 180 

cgaaaactgt aagttccagg gtacatgtga aaaatgtgca agtttgaaaa ataggtaatc 240 

atctgccacg gtggtcatgc tgcaaagatc aactcatcac ctggatatga agcccagcgt 300 

ccagtacctg ttgttacaga tgctctaccc accacatgca agccccagtg tgtgttatcc 360 

gcctccaccg tgtgtcaatg tgttctcatc atgcagctcc catttataaa cgagagcata 420 

cagtgtttgg ttttcagttt ctgcataagt tcgcttagga taacggcctc cagctccatc 480 

catgtccctg cagaggacat gatctctttc ctttttatga ttatataata ttccatggta 540 

tatatgtacc atattttctt catcctatca tttatgagca tttggagtga ttccgtgtcc 600 

ttgctattgt gaaataccat ttgacccagc aatcccatta ctgggtatgt aaccaaagga 660 

atagaaatga ttgtattata aaggtacatg cacatgtatg ttgattgcag cagttttcac 720 

agtagaaaag acatggaatt cacccaatcc gaattttaac aagtcttcca tgtgattctg 780 

agacagactc atgttcgtgt ggttctggct ctgcagaact gaggtgggcc ccaaaacttg 840 

cgtttctaaa aggtactagc tgagcatgat gactcacatt tgtaatccca ccactttgtg 900 

aggccgaggc gggtggatcg cttgaggcca ggagtttgag accagccttg ccaacatggt 960 

gaaactccat ctctactaaa aatataaaaa tttgccgggt gtagtggtgc acacctgtag 1020 

tcccagctat ttggaaggct ggggcaggag aatcgcttga acctgggagg cagatgttgc 1080 

agtgagctga gatcaggcca ctgcactcca acctgggtga cagagccaga gtctgtctca 1140 

aaaaacgaaa aacaaaaaaa accaaaagac taaatgtata aaaggtaccc aggtgatgtt 1200 

gatgctactt gtcttggcca atgaaatgaa aagcctacag gccaggcatg gtggctcatg 1260 

cctgtgatcc acaactttgg gaggctgagg caggcagatc acctgaagtc aagaattcaa 1320 
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aaccagcctg accaacaCgg tgaaacccca tccctactaa aaatacaaaa ataacaccaa 1380 

aaaaaaaaaa aaaccacaac aaaCCagcca ggtgtggtga tgtggcctat agtcccagct 1440 

atttgggagg ctgaggcagg agaatctctt gaatccagga ggtgaacgtt gcagtgagcc 1500 

aagatctcac cactgcactc tagcctgaac ttcagagaga gactatgtct caaaaaaaaa 1560 

aaaaaaaaaa agcctacagt tgacaggcag ataacagagg gagcaaagga gtcttgtgga 1620 

agacaatagg gagtgttgag gacctgtcaa acaacacagt ccccatctgt actttgggag 1680 

ggaaggactt cttaattcca gctgcctgtt gccatggggg aacatagacc cagaactgcc 1740 

atcctttcta atttttttiaa aaattaaaag gctaaaagtt ttggcttata aaaatggtat 1800 

ttcctgagtt tttacaaCta tagtattaag atattcctag gtttttgttt gttttcgaga 1860 

cgaagtttcg ctctcatttc ccaggatgga gtgcagtggt gcaatctcct ccgcctccca 1920 

gtttaagcca ttctcctgcc tcagcctcct gagtagctgg gattacaggc gcccgccacc 1980 

atgcccggct aattttttgt attttttagt acagacgggg tttcagcatg ttggccaggc 2040 

tggtctcaaa ctcctgacct cacgtgatcc acccgccttg gcctcctaaa gggttgggat 2100 

tacaggtgtc agctagtgca cctgacccta ggataatctt aattttaagt aactggtaac 2160 

tcataaattt ttaaacactc tgcagttcaa agaaaacatg tttgtggtag gtagccagtt 2220 

tataacctgt gtaagactgt gttattagca aatccctcaa tgtgttgacc aagagtgggt 2280 

aggaatcctg cagacagaat cttttgaagg tggtttgttt ttttattcct ttataagtac 2340 

agttgtagat ttacaaagta attgagtata tagtacataa ggtcttgtgt acccaccgat 2400 

cgcaattgag catatagtac atgctgtcct gtgtacccac tgatccctgg tcctgcccct 2460 

ggtatgaggg cattaacatc ttgcattagt gtggcacatt tgctacaatt aataaaccaa 2520 

tatcgataca ttattaacta gtctatagtt tacatgaggg gctgctgctt gtgttgtaca 2580 

ttctataggt ttgttcaatg cataatgaca ggtgtctgtc atttcattat ggagtgaaac 2640 

cattccaatt tcatacagcg tagtttcacg cctctaaaag tcccctgaaa gtgttattaa 2700 

aagtactatc tggccatgtc ctcatttccc cagcaaagtc cttctgagaa tttaggcgct 2760 

ttaggctaga ggatctcagt gctcagaaaa gcggtattgc attgtgattt cgatttggat 2820 

tttcttgatg gcttaggata ttgaacatct cttcatatga ttattggatg gatgattttt 2880 

ttcagcggtt atcaggtcac acctgtaatc atgcccacct cattttctta cagtgcagtg 2940 

Cgtgctttta ggcacatacc aagagcttac agagatcctt ctggttcaga ggaaagattc 3000 

acatccccaa caaacactgc tccccgatta caatatagag aggctaaatt atagtccata 3060 

tcacttttcc tttaaatccc tttacttttt tgtgctacaa gagtttcaga ataataattc 3120 

tgcagaaaaa agtactgatt gtccagcagt tgtgtcaggt agaaaactgg gatgcccgac 3180 

tgcattggtt tagccagaac tgtggccatg aaaccttgcc tggcccccag ggatggggtg 3240 

aggggggata tcatatacgc tttaggtggg gcagaggggt ggtctgcccc tgtgcaggca 3300 

ggaatgttca ggaagcaggc tgcctctgtt tctactctaa atatttaaca ttcacggatg 3360 

gcatggatcc tgactaagcg gagcagatgc tggctctgta ttgcacatag ctcctgaata 3420 

tcagccctgc ttggctatgc atgtgttttt tgttagtttt Cgagacagag tctcggtctc 3480 

ttgcccaggc tggaatgcag tggcacgatg ttggctcact gcaaaccctg cctcccaggt 3540 

tcaagcgatt ctcctctccc agcctcccga gtctctggga ttacagacgt gcatcaccag 3600 

gcccagctaa tttttatatt tttactcgag acgtagtttc gccatgttgg ccaggctggt 3660 

ctcaaactcg tgaccttagg tgatatgcgc ccctcagctt cctaaagtgc tgggattaga 3720 

ggcatgagcc accagacatg acctgtgttt atattgttaa atcaccattg tcaatcttga 3780 

aaggagactt tgttttgatg tatgtaagat tgagttttta ctcatctcta catagtttcc 3840 

tcagattgct tttttttttt tttgagaccg agtctagctc tgtcacccag gctggagtgc 3900 

agtggcacaa tctttgctca ctgcaacctc tgcctcccat gtttcagtga ttctcttgct 3960 

tcagattccc gtgtacaggc atgccaccac acctggctaa tttttgtatt tttagtagag 4020 

atggcatttt gccatgttgg ccaggctggt cttgaactcc cgacctcagg tgatccactc 4080 

gtctaagact cccaaagtgc tcggattaca gatgtgagcc actgcaccca gcattcctta 4140 

gatttccaat aaaaataaaa agctgtgtgg gaagtcagaa cttggttgct tcatgtcata 4200 

tctcctattt ttaagattta cagatattaa cgactccttg ggaaaactgg acttcagtcg 4260 

cctctctcca acaaaaaacg accattgggc gatgctggct tataacaggt ccaagctctg 4320 

caacatcctc ttctccaacg agctgcaccg tcgcctctcc ccacgcgggg tcacgtcgaa 4380 

cgcagtgcat cctggaaata tgatgtactc caacattcat cgcagctggt gggtgtacac 4440 

actgctgttt accttggcga ggcctttcac caagtccatg gtaagagaac agcttctggc 4500 

gccgcaaaca ccttgggtcc tagagaaacc tgcacacttg tgtctccacc tttttacctc 4560 

ttgcgggcat gagtctggtc tcagtaataa cattgtccag cccatcataa agggctcttg 4620 

aacacatttt catcaacttt aggttaagtc tgtttgggta aatgcgtctt ggagggctgg 4680 

gtagaagatg tgggtttcag tatcatgtta agtatggctg aaagtcctta tggaaatggt 4740 

gatttttctg tttgtttggt tttgtttttt ttggggtttt ttattcagaa actttgaaaa 4800 

tctattttgt tgciatggagc acttgaaaac tgctgttttg tgtcagtagg taeiacaacaa 4860 

acattggtga ctactgciatt ttcagcagat gtgattcctt tgtttcacag aaaaactgga 4920 

tcttttgttc tciaatttttt tcttctaatg ggtataatcc tctgttggag agtcctttga 4980 

tagctaggag tgtgttttct tctttacttt cccaaagaca attttggatg aatcatggta 5040 

ctgtggttac atttggaagt gtttacaaag gtgataagat gtttttatag tttggtgttc 5100 

atttatcgac catattciaga accttcttct atgaaatggt tttagtagga agtattttga 5160 
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atgaagaaag cgtattttcc acaatatatt tggtagatta ttttccaaaa ccaaaggact 5220 

tcaaaaatgt cttcctatta gtggacacca gtattccagt tacccaaact taaaatctac 5280 

ttgacagaaa tcaccctttt cccaaacaca cttgcctttt agcgatatca caggccttca 5340 

cagtcggacc taggatatta ataaaacaaa gcaaaacaac aacaaaagaa gaactattgc 5400 

aggcctatta tcctccagct gatctcacga ccttggaaac tgtctgctgc cttctcttac 5460 

actcattccc acctgccacc cttaaccttc gtatttgggg atgacttttt tttttttttt 5520 

tttttttttt tttttgagac ggagtctcgc tctgtcgccc aggctggagt gcagtggcgg 5580 

gatctcggct cactgcaagc tccgcctccc gggttcacgc cattctcctg cctcagcctc 5640 

ccaagtagct gggactacag gcgcccacca ctatgcccgg ctaatttttt gtatttttag 5700 

tagagacggg gtttcaccgt tttagccggg atggtctcaa tctcctgacc tcgtgatccg 5760 

cccgcctcgg cctcccaaag tgctgggatt acaggcgtga gccaccgcgc ccggcctggg 5820 

gatgactttt aaaaccacat caaaagtctt ttgaagtgct taggaggagg taagtaagta 5880 

tatcttttat tctaatcacc caagtggttt ttaggcagag taaacagaca tattccaaga 5940 

gtcgaatact taggatcagt ggcagtgaag tgaaaggacc tacggagaat ggtgttccat 6000 

gccgcagggt gggattgatc tctaggaagt aaatgaatga aacttacctg taggcggatc 6060 

acagggtttc gaaatctccc catcactctg ctcccgatga ctgcataagc gcagcctctt 6120 

catgaaagtt tctttgacca tggttatgcc tcatactttg ttgtgtcctt cagtttacat 6180 

gaaaagaagg aatatggatc tgaactgggt gccagtgcca tttaaatgag gcatttagga 6240 

atggggagag gagagaggtg gattcttcct tattttctgc tggcccagtg ccttccgctt 6300 

taacaagccc agtcaaccat catttgtaat ctttgggtct tgagtcatct cacttgtatt 6360 

tgatcattga tgtgttttaa ggagctcggt tcagtggcat cagagagttc gttttctcct 6420 

tcttccataa ggtccctaat cagaggaacg ggatgggaaa gctttagaga tgagtctgtg 6480 

actatggaat ttggtgaggg gtgatataac catcctgcca tttgttgtag actcgaatgg 6540 

atgaggagtc attgatagaa aatagtggaa actgttattc tgtggcctga tcccattgct 6600 

tagggactct gtgccagggc atgtgggaaa agaaggaagg caagttctgc taaaatataa 6660 

agtcagggta aacattttta tttattggag aagagttgat tattatctaa tatttaatat 6720 

aatatgctta ataaagtcag acttgaaagk tggctggttg gacctaacgg gagtatctct 6780 

aaaaaatacc agcaaattaa gttggatcgg tgactgtatc atgatatcgg agtgttagct 6840 

gttttgggag tacgtgtgct caggacttgg gctgagtgga aaaccagagt atggggttag 6900 

aaatggaggc agagggacta ccatgtcggg gaggatgcac aatgaattgg acgaaggttg 6960 

caactcagtg agcaccgtac ctgtagtgtg tcttagtaat agcagatgcc caacctccat 7020 

gcctaactcc tttgcctacc ccaccagtgt gcagcagcct ctgccgcacc ctactgcagt 7080 

etc tec tat t tecctggcag gagagetcag agactacttg tgaacaggcc tcaatgccta 7140 

tagcaaggca aggcttggga gaaacaaatg tcctctggga gcagccttgg gtaaaagatt 7200 

aactgcgcac tgagctttag atccctgagt gagaaagctg tgggaaaatg agtgtgtagc 7260 

tcatgagagg gtttggagaa aagtggtgac atcgcaaaaa tcccaggttt tgttgaagtt 7320 

gtetgtggag gggcactcat agaaagggca ctggacgtga gcatttecet tgtagttgag 7380 

atgggatttg acatgacttc ctggtggtgg caagacaaga aagtggcaac ctgggacagt 7440 

ggacctcaca gctctcctgc taatggtgcc gatggctaag ctcagctctt acccttgaec 7500 

actggaacaa cttagacatg tcaceagctt tctgagcctg tctcattacg atgaaaattg 7560 

tgatacaggt tagtgtaagg accgaeaata atgcatataa eatcaataat gatggatata 7620 

gaatttctgc agttattata atctggctct ccagtatggt agtcgtgggc cacatgtggc 7680 

tattgagttc taaaaatgtg gctaatagaa ctgaggaagt gaattttcaa tttgattcca 7740 

tgtttgttaa ttcgaactga aaaacaceac gtgtgccaag tggctgccat actggagagc 7800 

gtggtcaaat gacatgccta gctcagtgcc tttegcatgg gaggtatgca gtacttectg 7860 

ccacgatggc tgttactctt gtcgttgtga gcagtgcctt ggtccegatg atgattetec 7920 

aaaatgtaat ctetgctgac agaacaagag tctgcaaatt gggagtctgg aaagggaaga 7980 

gaagggcttt gtccctggcc ccaaagactc agggagaggt ttactggcca ggtagaaagg 8040 

gcgtccaggt gaagggaccc tggttcagtg atctccaacc tgcacgtctc atttgtgagg 8100 

aagccgtagg gtgtggactt ttcttgttte tgaccttgag tgattagaaa atagcagctt 8160 

ttatatgtca caaatggact ttagataagc atgaagatga cgaagactgt ggaatatgta 8220 

getaagagtc ttactaaaat cetctctagt gtatttatat atttaagcgt ttggtagtac 8280 

ttttttagcc ateagcgtte tctatattag gttgatacag aagttattga ggtttttgcc 8340 

attgaaagta atacgtcact tcatgcagag gtggaattgg cgctctgagc ttgcttgtca 8400 

aattcacagg cattcacttt tetttcataa aeatttagga ctctgtattt aggagaggaa 8460 

tcactgtgaa tgtagaccte gggttggtca ttttcctagt aagatagacc agactgaetg 8520 

gciaaagtcac cctcttggge tatcctgcgt atgtggctcg gcattgcttc cgtcaggata 8580 

atgcatcagt ttcttggaca aagggtgcat ttctgctatt tgaatgcaga catatttttg 8640 

gatataagcc acgcagttgc atcceattte ttccgtaggt aaaggctggg tttctaattg 8700 

tgattgagaa gcttagcata gatgcaatgt ccctatcacc agatggctag tgtgctctgc 8760 

cttgtctgcc ttttacctta gagagggtgc ttccttctga cacgggtatt gctggagtac 8820 

aeattctgtt gcatgaggtc aggggagcaa gaaatacaac ceagacttgc getcggaagc 8880 

cctgccttca tttcttctct gtagctggct cccttaagta ttaaacagca gtattctaat 8940 

atcaacgtce cttttttatt tgctgtgaet tgctgtgact acggtgattt tagttacagt 9000 
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aattaggtca ctcttcatgg agtcgggcta ttttaagtgc ctctttaagg aatattgtCg 9060 

cagatcagta attttacaat tggatgttct tgtgcaactt tattaaatgt gatctgtcct 9120 

ttaaatgttg cccttttaca gttacacatt tctaattgtt gctgtaaatg cattatcact 9180 

ttgaaattaa ttaatttttc cattttggct gcctgcaata taataatcac taatgagaaa 9240 

cccacaggaa tcaattctca gcacagcggc ccctactggt aaagccatgt ctctgcctcc 9300 

ctcttcttct tcagatccag agaactggac aagtgtattg gggcgtcctg ggagggaagt 9360 

gtttgtttct tcctcccttc aatctatggt gaaataaaaa tcacagataa acttggaaag 9420 

tgagttattt ttaagtgtcg ttctaaaaaa gatgcaatat acttcaeiata gctgcttcta 9480 

aaatatgtta aaggaaacta acatttgctg agtactttct aataatgaag tcacatcata 9540 

cgtgcattac atgtttattt tcttgactct agagcattct tttatgcagt caaattactc 9600 

taaaatgttt tgcacctgca cgcattaatg tgtattgttt ggtaagaact ttcgagttgg 9660 

tgaaaaaatg tattgtattt tagtatctat atttatatat acatattagt ctatgggttt 9720 

taattcaccia aaccaggtgt tgtatctggt gcacataagg gattttcaag gctaatcttg 9780 

ggtttttgca gttgtaacca tgtgtgatga ctttagaatc ctaccacact tccaaataga 9840 

gatgcagatt gaccattttc ccagatacca cgatgcttta tgtatcagca tatattttca 9900 

ttaatttctc ataatttctg ctagaattaa taattccaag ttctacaatc ttagaattat 9960 

tatctttatt ttgtaattga aaagcacaga aacctcatgc agcgtgttca agaaaatgat 10020 

ataccaggct tgtaaacttt cacttgctaa ttgcagctgc atttgctttg tccaatgcac 10080 

ggtaatgccc ctggcatcca tatctagtta tgcagttatg cttccttttt aaaatgtttg 10140 

acagcacctt cactgaaatg aatgctgcca ccaccgccaa ccccaccttt tttttttcca 10200 

cataagcttt catttctggt aaatacatcc ttgccagagt tcccaggtcc aagccggtcc 10260 

tttgtggacc agtatgcaga tttgaagttg agcactgtca cttgcctctt aggagtgttt 10320 

tgtgcaacca catatttgtg ttggcagata acagtagaaa gaatatcatg tgttagggta 10380 

gcaagagctt agcagatgtt ttagggaact gcattttcaa atcattttac attaaaaaat 10440 

ggcttacgtt tatggaactt ctgcgacgac cgggttacgt ttcctgtgcc cgtgtaaagc 10500 

cagtcatcgt gctgagtttt aacggatgtg cagctaccct gctaatggat gggccggcaa 10560 

tgctggtcct gttgcaaagt tataaattat aactagaaaa agcacaagat gtcattcttc 10620 

taaccttcgt gaciagccaaa cggggtaatt ctgcccttga ataacccgct gcagggtgat 10680 

tcatagttgt acaatttaaa acgtciatgaa ggagataata gatgttgatt tgttcctgaa 10740 

attagtattt tagaaaaata cttctccccc tttccatctt taaatacctg cagctgacta 10800 

cactatagat actatcaagg gcaaagcccc ctgtttgaat gtgtattggt acattggtgt 10860 

atcacataca aggcagaggg tgtatsiaciac aggctttctg ttgcgagggg gaaatatggt 10920 

tttcaaagtg gaaatttcat tgtattcttc atgaatctaa acattttgag actcctgagt 10980 

gaggtttgaa ttttctggtt gcctttctct tgccagtcca gatatgttag aaaactggac 11040 

tctcaggacc cctctgtggt tggaccacac tggcgctttc ctgcagttgt ctgtcaacag 11100 

aaggacctgg tagctggctt gtcctcttac accactgagg ccaccttctt gcaggaagaa 11160 

acatacttct tcctgccctg gattaacatt accattggtt ttagtagtct tacgCcatta 11220 

tgtccaaagc ttttaagttt acagtgcttg cttagtccag tgaccaatgg atgtaagtaa 11280 

gttggciciaag ggcaaaggat ttagaatcat gagggttgta ttcaaactgc agcctctact 11340 

cccttttagt tacctaattt tgaaatcatt tgacatttct gaggttaatt ttctacatca 11400 

gtttagctgc tttaggccgc aagttattaa cacccactta aaggtggctg aaataataga 11460 

ggttttgctt cccatgtatc cagagttgct gaggcagcaa gggttgtgtt tggctactat 11520 

atgagtagct cctttgtgat aatctttact tttccctcat gcttacaaga tggcttctgt 11580 

ggtgccagcc attgcatcct atttgataag gtctgaaggc acagatggga gggatgaggc 11640 

tcctcattgc atgccttttt cctttatcaa gagacagagc attctgcttc agaactgtcc 11700 

tcctgagttg acttcccgtc agatcccact ggttgggact gagtcatctg tttgtatcct 11760 

agttataaag gagactagaa aagtgcataa ctggcttttt gagcctctag agtggcaggc 11820 

gaactctgat tttgaggaaa aaggataggg taatggtgtt gacacaggca ctgttgaata 11880 

cgtatgtaca tattttaata gttttctcat taatcaaatg gaaatgctat cgattttata 11940 

ggatcgttgt gagaagaagt gagaattgaa atgaaaggca tctagtcttc cccggtcccc 12000 

atccactcga cccaccatac ctcaagccac ttctggctcc atgctctttc tttccttagg 12060 

gttttgaaac atgttcttca gcctagaata c tec tec tec actettaatt tctaeccate 12120 

ctgcaaaaet cagtctgaac agcacacgtc aaggaagcag ttttggtctc tcagattata 12180 

tgaagctctc csuitatgtte tttcagaaca gccagtgcct agaatactgc ctgactcgta 12240 

gtagcactcc ttatgtgttg aatgagtaaa tgaagagtcc ctggaatttc ttcttcataa 12300 

eattgaaeae aactgcagtt tgaaaacata atttctgtag tttttttttt ttcttttttt 12360 

ttgagacgga gtgtctetcc gttgcccgga ttggagtgta gtggcacaat cccggctcac 12420 

tgcaacctcc gcgcctcccc agttcaagca attctcctgc ctcagcctcc caagtagctg 12480 

ggattacaag cactggccac caaacccagc taatttatat atttttagta gaageggggt 12540 

ttcaacatgt tggccagget ggtctcaaac tcctgaccte aggtgatcca cctgecttgg 12600 

ccttccaaag tgtggtggga ttccaggcat gagccactgt gcetggcctg tagtttettt 12660 

ttcttttttc ttttcttttc ttttttttga gatgaagtet tgetctgtca cetaagctgg 12720 

aatgcagtgg tgcaatcttg gctactgcag cccccgcctc eaagttcaag cgattctcct 12780 

gtgtcagcct cctgagtagc tgaaatacag gtgtgeacea ccacgcacgg ctgatgtttg 12840 
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tatttttagt agagacggga tttcaacatg ttggccaagc tcgtctcaaa cctgcctcag 12900 

cctcccaaag tgctgggatt acagatgtga gccactgcac ccagcctgta gtttcttatt 12960 

taatgtctgt ccccccttct cttgacagtg gtgtttttgc cttcacacta ctgtatccct 13020 

ggtccatacc gtaaatgttc tgtaaatgtt tgtaaacgaa tacatgaaag atgaccaggt 13080 

gctgaatcag tgagagtcct taatgtactt cttatcattt atccctctga ggtaggcaag 13140 

ggattatagt ttagaatgtg taaatacgta aaagataagg caaggagacc tgctgagagt 13200 

cacacagcaa gcccccaagg tcccccaggc tcatggtcaa atgacatgta agggacgtat 13260 

ccagcggtgt gggtatgttt ggagggaatg aatgtaagga gagagaggtg aggcacagga 13320 

gagacaggat gagctttaga ggtagacagc tctcgggtcc agccctagct ctctcccatt 13380 

ctciactcttg ggcaaggttc aaaacctgat taagattcat gcataaaact gaga tgt tag 13440 

aactacctca tgggcttgtc ttgagaatta aagacgttat gaacttaagg cctctacagt 13500 

gtgctagatg cttaagaaat gttaactact attgttattg gcttgcagta gaaattaaag 13560 

attatgaatg ttcagcattc agaatatact agatactcaa taaccattat tgctgtcatt 13620 

attaacatta ttttccccag ctttactgag gtatgagtga caaaaatttc atatatgcgc 13680 

atgagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtggctgg gcatggtggc tcttgcctat 13740 

aatcctaaca gttgaggagg ccaagctggg aggatgactt gagcctggga gttggaggct 13800 

gcagtgagct atgatggtgc ccctgcactc catcctgggt gacatagtga gaccctatct 13860 

ctaaaggaaa aaaaaaatta tatgtattcg tggtgtgaiaa tgtgatgttt tga tat teat 13920 

atgcattgtg aaatgactac cacaatcaag ctaattaaca catcggtcaa cttacatagc 13980 

tatcattttt tgtgtgtggc aagaacattg aaagtctagt ctcagcaatt ttcaagtata 14040 

taatacgtta ttattattac tatcgttaac tgtattcacc atgctgtaca atagatctcc 14100 

agaatgtatt cattctgtct aactgaaact taggatcctt tgaccactat ctccccattt 14160 

atcaccccct cccacttccc cactaccagc ccctcctggg aactgccctc tactctctgc 14220 

ttctgtgagt ttgactgctt taaattccac acaaaaagtg agatcataga gtatttgtct 14280 

ttctgcgcct ggcttatttt acgataaatt gattgcttct cttcccattg tttatagata 14340 

cccattctcg taatgatggc actgtgtctg tcatagaagg cctaattctc cctgttaaca 14400 

ttgaacataa aaacctcata tgggccaggc acagtggctc atacctgaaa tcccagcact 14460 

ttgggaggct gaggcggatg aatcacctga ggttgggagt tcgagaccag cctggtcaac 14520 

attgcgaaat tctgtctcta ctaaaaagat acaaaaaaat tggctggctg gtcatggtgg 14580 

ctcatgcctg taatcccagc actttgggag gccgagccgg gcgaatcacg aggtcaggag 14640 

attgagacca tcctagctaa catggtgaaa ccctgtctct accaaaaata caaaaaaatt 14700 

agccaggcgt ggtggtgggg cgcggtcgtg ggtgcctgta gtcccagcta ctcagaggct 14760 

aaggcaggag aatggcgtga acccgggagg tggaggttgc agtgagccaa gatcgcacca 14820 

ctgcactcca gcctgcgtga cagagcgaga ctccatctca aaaaaaaaaa aaaaaaaaaa 14880 

aggctgtgtg tggtgactcg tgtctattgt cccagctact tgggaggctg agacatgaaa 14940 

atcacatgaa cctgggaggt ggaggttgca gtgagctgac atcgcatcac tgcactccag 15000 

cctg 15004 
<210> 52 
<211> 4559 
<212> CNA 
<213> Homo sapien 
<220> 

<223> Dtih sequence enconpassing exon 7 
<400> 52 

ttttagttga catgtcatta tggattcttg ggcataaacg tttatatgaa ttttgaatat 60 

taggaaataa tcttggaagc tatattagtt ttctaaagct attataacaa attaccacag 120 

atggagtgcc tcaaaacaac agaattttac tctcttacag tttcagagac caaacatctg 180 

aaatcaagtg gtttgcagcg ttggctcctt ctggaggctg gaaggagcat ctgttccatg 240 

cctttctcct agcttcccat gactgccagc cacccttgga atttcttggc ttttatctta 300 

gtcagtctaa tctctgcctc tgtctttcca tctccttctc tgtgggtgtt ccttctcccc 360 

ttctgttttg taaggatagg cctgtcattg gatttaaggc ccaccttaat ccaagatgat 420 

ctcattttaa gatgtgggac ttaatcacat ttgcgaagac cctttttcca aataaggtcg 480 

cattcacaga tccctaccat tcaactcact atggaagcaa tgggatggga ggtaccattc 540 

aactcacaga ttcctaccat tcaacccact ataaagcagt ggcatatgaa aaggatggga 600 

ggttgctatg ggaatggtct gccccagatg caggtaaaat ggggaagcat tttatataga 660 

atggaaaaac aacagtaaag ctggctaaaa cttgatctgc tttttacagt catgtgctga 720 

caattctaaa cattatcaat gataaaatac ttctccctca tgatattaac tgtgcaattt 780 

ttgctgcccc tgctacatac atgctttatt gcttggaaac aggattactt ggtgaaataa 840 

tgtgaatctt tcaattctca tgatacatgt tttcaactta atatccaggg tgcaggaata 900 

tatattccca ccaactttat acaagataat acaaatataa tagtatgaat aaatatgcta 960 

atttgataga aaatatttta atatttacta ctagtgtagt tggacgtgtt aaatattgca 1020 

atggttattt gtaatctacc tttataaaat ttctaatgtc tgccactgtt tctgttagac 1080 

tcttaagaag atacgtgttt gtctcataga tttgtaagaa ctctttatat actaaaaata 1140 

tggggcttct atatgattgc atatattcgt cacttttgtt ccactgttct agagagcaga 1200 
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agattttaat ttctataaag atttttatgt ttcccWtat tgcatgccaa tgtttgtttt 1260 

ttaciaaactt gtacgtctta ggctgcgatg tgaaaagtgt cttttttttg atttggaaac 1320 

agagtcttgc tctgatgcct gtagtggtat gatcatagct cattgcagcc cttgaactcc 1380 

tgggctcaag taatcctcct atctcagcct ccgaagtagc tgggactaca gccacacacc 1440 

accacacctg gctcactttc gggtggttgt tgttgagatg gggtcttgct gtgttgctca 1500 

ggctggtttt gaatactggg ctcaagtgat tatcctgcct tagcctccca aagtgctgga 1560 

atttcagagg tgagccactg tgcccagcct cctgtttttt ttacatgtat atatatatat 1620 

atatatatat atatatatat atatatatat atatacacac acacacacac atatatatat 1680 

acacacacac atatatatat acacacacat atatatacac atatatacac atatatatac 1740 

acacatatat atacacacac acatatatat acacacacat atatatacac acacatatat 1800 

atatatacat atacacacac acacacacac acacacacac acacacacac atatattttt 1860 

tttttctttt agatggagtg tctctctgtt gcccaggctg gggtgcagtg acatgatctc 1920 

tgctcactac aacctctgcc tcctgggttc aagtgattct cctgccttag cctcccaagt 1980 

agctgggatt acaggcgtgt gccaccacgc tgggctaact tttgtatttt tagtagagat 2040 

gggggtttca ccatgttggt caggctggtc ttgaacttct gacctcgtga tttgcctgcc 2100 

tcagcctccc aaagtgctga gattacaggc gtgagccgcc atgcctgccc ttatttttat 2160 

atatcttcca taacctttct atctctttct ttctctttct ctgtttctgt gtgtgtctgt 2220 

gtctgataat aaaatacatt aaattttatt ttataacata aatgttttat cuiggtcggta 2280 

gatttgttta tgattttaat ggctgcatag tattctatca tatggatata ttctaaatgt 2340 

aataattcct caatggtata tttagtttac tttatatttg ctgctttagt ttgtcacaaa 2400 

cactgtggat agccttgtat atatttattt tgttgtatag ttcttattgg tacagtatga 2460 

tttttcacaa gtatacaatt actttctgag aatttagctt tttgattcac gtgaaagcct 2520 

tacttgtatt gttaacsiatt ttgttacgtt tgtgaggcct aggcagaagg attgcttgag 2580 

accaggaact caagaccacc ttgaccacct tgggcaacat agggggagcc catctctaca 2640 

aaaataaaaa ataagaaata aaatagccat gcatggtggt gtgcccctgt agtcccagtt 2700 

actcgggagg ctaaggtgga aggatcgtct gagcctggga gtttgagact gcagtgagtt 2760 

atgaccacac cactgcactc cagcctggat gacagagaaa gate teg tea aaaaaaaaaa 2820 

aaaaattaaa tatatacaaa ataagggaga atactatgac aggcttccac ttacctatct 2880 

ttaggcttct ttataaaatg ttcatttaat ctgatctcta taagaaattc tgtatcctta 2940 

aacaacatgt gcccttctct ttacacagag cgattacccc cttgctatgt tttattaaca 3000 

gctagcaagt gctgggtcct gtgtcagttg ttttgttttt tctttttctt tgtttttctt 3060 

ttcttttctt ttcttttttt ttttcgtttt tttttttttt ggagatagag tcttgctctg 3120 

ttggtcaggc tggagtgcag tggcatgatc ttagctcact gcaacctctg cctcctgggt 3180 

tcaagtgatt ctccggcctc agcctcctga gtaactggga ttacagacat ataccaccac 3240 

acctggctaa ttttttttgt gttttgacag agatggggtt tcaccatgtt ggtcaggctg 3300 

gtctcaaact cctgacctaa aatggtccac ctgccttggc ctcccaaagt gctgggatta 3360 

caggcgtgag cccccgcgcc tggccctgtg tcagcctttt taacagcata atttttcctt 3420 

tattagtcat aactacttta agggctattt tgatttacag acaaagtgag acttggatga 3480 

attaagaaag aattaatcag gtggttgaac ctgagttcca actagggtga caactcttcg 3540 

cagattgcct gccgctgtcc tggttttcac actgaaagtt ccgtatccta ataagcctct 3600 

catcctcagc aacggaggac agttgcccac tcaaagcctt gtgacattct agggtatcct 3660 

atttctacat ggtgggaatc tagaagacgg agaeiagaatt tctcattccc gaaggagcat 3720 

ggattatcct tggttgtagt gtttatgttc cacatcacgt ggattcccga aggagcatgg 3780 

attatccttg gttgtagtgg ttatgtccac atcacatggg atattttatt tttcaggcct 3840 

cttcatgtgc ttgtgtgcaa cgcagcaact tttgctctac cctggagtct caccaaagat 3900 

ggcctggaga ccacctttca agtgaatcat ctggggcact tctaccttgt ccagctcctc 3960 

caggatgttt tgtgccgctc agctcctgcc cgtgtcattg tggtctcctc agagtcccat 4020 

cggtgggttt gaattgcata tttgttcact tatccccttt ctcataccag ctaatattcc 4080 

cccaaggctc tcattctgewi aataattttc attagtcctg cttgagacat gtgggtggac 4140 

tcagcttggc tcacttaatt tttccaggtc ttttttgttc gcctgcgatt gtgggggact 4200 

gtttagaagg actttctaga gcaaggaaga ttgcctttac gactatactt caagctcctc 4260 

attgattttc gcttacagat ggaataataa cttcatgaaa aactcaatgg catgaaccta 4320 

ttattggatt tgtaattcaa caacttcaac atcttaccaa gaagaatgtg cagttattct 4380 

agcaggagaa acaatgcaat tagagcctgc gagatgaaat caaattgttt tataatgaga 4440 

aattagggaa ttcgaggcag acattagctg tgtaattgtg gaaagggaag aactgtagtt 4500 

agagcatatt agaaatctgg ccgtgcctct tttggttaaa atttcaatta aaacatcag 4559 
<210> 53 
<211> 26040 
<212> DNA 
<213> Homo sapien 
<220> 

<223> DNA sequence containing the FRA16D fragile site 
<400> 53 

gcttttgaaa ttaagatatt gtccattgca tctttccatg atcgaggcgt tgtcgtgcat 60 
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tcccctcctt acagcgtttt aggatgcaga ttgagaggag ataacgttag cccgaagcgc 120 

ctttgcagcc tcttttgcgc ggctgaaaca cagcacaacc cacccccccg ccccaccccc 180 

cagcctgtag gtttagcaga atcccagcct cacatcctcc ccgaacttgg cagtaaaagc 240 

cctgttcttc cattcattcc gatgatttat attctctctg ggcgtcttat attaaacagg 300 

ggaattccga catgttccat aacacattta ctgtaacttg ataccatgaa ctacacttgc 360 

tgttatttat datttctttt tattttctct cattgcagca taaagccaag gtagaaacaa 420 

tgaccctgga cctcgctctg ctccgtagcg tgcagcattt tgctgaagca ttcaaggcca 480 

agaatgtgtg agtgttccag tggagggtta tagatcataa tttcttgcta ttgtaatatc 540 

tttatcagat gaacacaatc gggagaatgc aaggctgttg tgttgtcttg gcgtccaaac 600 

aggaggctca tttatattgg ccctgttaag gtgaaccgta ttttcttgac tcacagtcac 660 

cttcattatg agatgtgtca tcaatctaat aacagcttcc cacaCaccaa aagagaagac 720 

actattaaag cactagtaaa agtggctaat aaaagcttgg caatagtaag atgcatcctg 780 

attataagat tttttgtagt gcatttcaga atggagtaag agtatattta aattgcatCc 840 

aggaacaagt aaactcaatt atccaatatg gcagggaggt tgacaatcca agcacccaaa 900 

aciacctctag tttctaaagc cttcgatgat ttgatgtggt acatggatgt ggttccaaaa 960 

aacatggact cacattcctt ttatttattt tttttcatcc ttttcagtct ttcaaaattc 1020 

cagttggaga aagccttagt tagggcctag catattttga tcctatcata tgctagcatc 1080 

cctttctaac agagaaggtt gtaggagaaa gggagagaag cggaaggggg tggggagaca 1140 

gagagacaga caggaggcct caaaccctga aacactgagc taaggaaagt gatcatggca 1200 

agctacacta attacaatac tttgtttcca agtgtttatt tttactcata tttagggcag 1260 

gcaatcctgg tttctcgttg aacatagagg tttgaatttc attaataaat aacttcattt 1320 

attttttttc agtgacttga ttcaaacatg aggattaagt taataatagc acaggttgtg 1380 

cgaaggataa gataattaca caagaggcac cagaaccact gaatgtggag agctctcata 1440 

aatgacaagc tgcctttggg ttaggctctg ttgggaacat tagttctgca gtgttgcaag 1500 

cagatgaagg atgtgaggga agggatctta aaccagatat tcaaatggcc ctgtggggag 1560 

ctgacaccac actgctgtct agtgtccaat tctccttgca tggctgtgtc acccaggttg 1620 

gaacgtagtg cacaatctcg cctteiatgca acgtcccact gcgggcccaa gtgattttcc 1680 

tgtcccagcc tcctgagtag ctgggactac agatgcctgc caccatggcc tgctaatttt 1740 

tgtattttta gtagacatag ggtttcacca tattggtcag actggtctca aactcctggc 1800 

ctcaggtgat ccacccgcct cagcctctga aaatgctggg attacacaca tgaaccactg 1860 

cgcccagccg ctctacttta ttagatttaa aaagtttgct ctcagctggg tgcagtggct 1920 

tatgcctgta atcccagcag tttgggaggc caaggcgggg agggtcatga ggtcaagaga 1980 

tcaagaccat cctggccaac attgtgaaac cccatctcta ctaaagatac aaaaaattag 2040 

ctgggtgtag tggtgcacac ctctagtcct agctacttgg gaggctgagg caggaaaatc 2100 

gcttgaacct gggaggcaga ggttgcagtg agccaagatg gcaccactgc actccagcct 2160 

ggcgacagat tgagactccg tctcaaaaaa aaaaaaaaaa aaaaaaaaag tttgtactca 2220 

gcccggcgtg gtggctcacg cctgtaatcc cagcactttg tgaggccgag gtgggcggat 2280 

cacctgaagt caggagttcg agaccagcct gggcaacaag gtaaaacccc gtctctacta 2340 

aaaatgcaaa attaactggg cggggaggca catgcctgaa atcccaacta cttgggaggc 2400 

tgaggcagga gaatggcttg aacccgggag gtggaggttg cagtgagctg agactgaacc 2460 

agtgcactcc agcccgggtg acaagagcaa aactctgtct caaaaaaaaa aaaaaaciaat 2520 

tgactcaggc tcttgctgga gtatgtcagt gtcccaagtc attgaggtct acattagagt 2580 

cagtcttagt gagagttgca acatcgccta agcgcagacc ccaagctggc ttgtaggaac 2640 

agtgagateia catcccccag ccagccagat ctgagagagc cctcctgtgt gtgtcctttg 2700 

cagcgcttgg ctgttagtag cttccaccct ttgccagaac tatcaagagg cacctcaaca 2760 

ggctgcaaca ctagcattca agacatttgt tcgctgattt gcttcgtcat tcatttatcc 2820 

accggttcct tcacttcaca cacatttcag gagcacctct aaagccccag acattcttct 2880 

gggctctgag aagagagtgg tggccctgga agccacagta accgcacaca agctgtcacc 2940 

tcagttgtca gctattatgg atggctgact cctcaaactc atctgtctgt ggttgtttgg 3000 

gagggaatga taatgacaag aacagcagcg gagtcagcag ctcatttact gagggcttat 3060 

gggccatacg tggaaaatat tttgcatgta cgttttgatt caattctcaa agcagtcctt 3120 

gggagCagat gctaktaata tctctactat acagataagg ciaagtgaggc tagagaggat 3180 

atataattta tccaaggcat gtggaatgtg ggtagagggg ccgatttagc ctcttcacgg 3240 

ctccaacttg accaccgtac tatactgtgt ttcattttag tttatttttt atttattttt 3300 

tgagacaggg tctcgctgtg ttgcccaggc tgaagtgcag tggtgcaatc tctgttcatt 3360 

gcaacctctg gccttgggtt caagcgattc tcatgcctca tcctccagag tagctgggat 3420 

tacaggtgca ggttaccaca cacagctaat ttttgtattt tctgtagaga tgaagtttca 3480 

ccatgttggc caggctagtc aactcctgac ctcaagtgat ctctccacct tggcctccca 3540 

aactgctgga attacaggcg tgagccaccc tgcccgaccc tttcatttta aatgtgatag 3600 

aaacagactc agagaatgtg agtgactttt cgaagggtca ctagcaagtt ggtgacgggc 3660 

atggggtttc actccagtct gcctcttgtt cttttcctca tagcatgtga gatctcacaa 3720 

ttcactgccg ctgaataact ttgtccagtt cctctgtcag aatgtgttgg aagtgagcag 3780 

ctttgctgaa gagagccaac tgttacataa tagggatcat cttccacgtc ttaactggac 3840 

cattagtttt taaccattcia gtagttggga aatcatcaag ttcaagtgct ttttgtaatc 3900 
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accaagcCca aaactttgga ggaaactctc acccaccttg ccctaatccc agattttagc 3960 

acaggacttt ggaattttag aaagtacttc taggttgaga aacaagggtg attttccata 4020 

tagagacctc tgtagcacaa ccctctttgt atgtgtccca tgagagtaca ggattttaca 4080 

tggcttagag aaatactgtg gtggtgaaaa tatgcataaa aaagcgagag attttacttt 4140 

tgaaatgctg ccaaagtgaa atctcttttg gataggcaga tgacagttta gtacttgcag 4200 

gtctttcgat tctgacaact gtcagaagtg aatttaagtg ctggtgacta ggctgccaga 4260 

gaaacacatg taactctatt cttatcaccc cccatcttca gacattaaaa ttttcttgtg 4320 

tgtaaaacat aatgacaaat gaaagaaagg ctgccagttc agaaccaatc tcttttcacg 4380 

ctgttgcatt gaggagagga aaaaatccct taaatttgaa ttccatgtgg atgctcttat 4440 

ttaatatgcc gtttcaaagc ctgtacgcga tcaagccaga gtattatcag gctctgagtt 4500 

cctgtgtgta ttcccttggt ccatgctcca tctcaagtgt ttctccacac tgggataggc 4560 

agcatattgt gtagtcttag agcagaaaag gtageiaacag attcttagta caaacactgg 4620 

tgagaaaagc agcatattgt actgtggaac cacccgccgt ctgccttctg tgggagatgg 4680 

ggatcagcct cttcctggat ctcttctttt tattcgggtg ctgggaggtg aggctgcaaa 4740 

ggaggctgtt gtagagggag ccctcctaac catgattatg taaacaatag ctgctaggca 4800 

acttgttgtt tgttttatgc aaggcagtgt gctaagcact tacatacatg tattctgtct 4860 

gtagtttgtt ataagagtcc tgcacactta accatgatta tccctacttt gcaaattaca 4920 

aggttgcagt gcagagagcc taagacactt tctcaacttg agcaagtagg agatccagaa 4980 

tttgaaccca aatctctctg aatccaaaac gactttttgt tactttgcac acttcagtca 5040 

taggaatttc agtgtggatt tctgtattgg ttatgtattg ccgagtaaga aatgactctg 5100 

gccaggcacg gtggctcaca catgtaatcc cagcacctgg ggaggccgca gtgggtggat 5160 

tgcttgaggc caggagtttg agaacagcct ggccaacatg gcaaaacccc gtctgtacta 5220 

aaaatacaaa aattagccgg gcatggtgat gcatacctga agtcccagct actcaggagg 5280 

ctgaggcatg agaattgctt gagctctggg gtgggcagag gttgcagtga gccgagatca 5340 

tgcctggggg acagtgagac tctgtctcaa acaaattact ccaaaattta gtggctaaaa 5400 

aacaacataa aacaagaaac atttctgttg tgcctcctct tttcctccct ccctctttct 5460 

catagttaat ttgtttgaat caggattcaa ataaggtcga tgcattttta atctataggt 5520 

tttgtctcct tttttttttt ttccttaaaa aaatttcttt taatgcaaca gatttaaggg 5580 

accagggtgt tctctagagc aggggtccat gacctctagg ccacaaacca atagcggagc 5640 

cgtggcctgt tataaattgg gtggcatggc agggagtgag tggtgggcaa gcataggaag 5700 

ctgcgtctgt gcttagagcc actctctgtc actctcatta gtgcctcagg tcctccccct 5760 

gtcagatcag caacagcatt agtctcatag gaccgtgatc cctattgtga gccacacatg 5820 

caagggatct aggctgcatg ctccttacga aaatctaatg cctgatgatc tgtcactgcc 5880 

tcccatcacc ttcaagatgg gattgtctag ttctaggaaa acaagctcag ggctcccacc 5940 

gattgtacat tatggtgagt tggataacta tttcattata tattacaacg taataataat 6000 

agaaataaag tgcacaataa atgtcatgtg cttgaatcgt cccaaagcca cccccactcc 6060 

tttgagtcta tggaaaaatt gtcttccaga aaaccacttg ctggtgccaa aaaaatttgg 6120 

aaacctctgc tctagagatt tctgcagtct gtactttggt gattgcatct ctgtggtgtc 6180 

ttttaacata ctcttctgtc tcctgagttt cctgtgaaag ggtagataac tcaagagact 6240 

caatcaactt cgtgtttaat gttttcgtaa gaatacttca gaggcagtgg tgtaatcgtc 6300 

caagagtaca cacaggcctg atcgccaaag ccagttttct tcaggcccag cacgtcctga 6360 

cctcctcact cctctgccca ctctctgttt actcactccc attttcattc attgattctg 6420 

ttattctcca ctaagttttc tgggtggtca gcccaggaag cctcagcctg cttccagata 6480 

gttctcctca gagttagcgg tagcagcggt cgtcggccag tgtgtctcgg gtcaaattat 6540 

tttgggcatt actggaaagt tccaaaccaa agcctgtggt aagttccacc caaactggct 6600 

tgtgagggat atttttgttt aaactgcatt cctggagagg agccagactt cctactgcct 6660 

ggcagcagtg gtgtccatag gagtatgaag agctgggacc ctttctaatc actcagacca 6720 

aatattgggg ttttctgaaa tggactgaga aataacagta tgtttttatg aggctttgca 6780 

cattttcctt cctagcagta gctgcttagt ctactgaaaa gtgcatattt tgaacagggc 6840 

ctagaagagt taacagctcc tagagagagg tgctctgtaa tactttttct tcttcaaaaa 6900 

atggtttatg gctgggcgca atggttcatg cctataatcc cagcgctttt gggaggccaa 6960 

ggtgagagga ttgcttaagc ccagaaacgt gagaccagcc tgggcaacgt agtgagacgc 702 0 

tgtctctgta ttacaaaatt ttttaaaaaa cggtttgtag gtccttaagt ccctgataaa 7080 

atagagaact gaattgcaat cctggaactt aaaaagttgg tgacgacacc tgagatattt 7140 

attacttaga ttgcagttac tgggtcagct tgtataatac tgaccaaggg tttttgattc 7200 

ttcctggaat tgataggaaa ttcatattaa aataattacc caagtccaaa catttttaga 7260 

actgcatttt tgatcatgga tttttatgtc tcttctgaac tttctgtcac cggtataatt 7320 

taaagaaatt atacttaagc tttgtctcac ttagaagata atatagaaca gtggtgtttt 7380 

tttaattaaa aaaaaagtta aaataacggt tttgtatcct tgctttactt cttaaacata 7440 

tgggaggaaa aaaaatcttt aacaagttta tttattttca ttttctgcta aattactttc 7500 

agaacttgaa tccactaatc ccagatataa tattcttgga ttcatattcc aaattttgct 7560 

gtctcaaatc catctaggga agtgggtggg ctataaatta taaataaatt ccaaattttg 7620 

tgggatgaat taccctgaag accaacgtgt aaattacata ttaatctttc tttttctccc 7680 

tagctcggtt ttaagaataa tgttttagcc aacatatctg cattactctt ggctcaatat 7740 
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gagaaatcca tttttggttt gcctaacaga agatcacgtt gctttgcttc tctacacagt 7800 

atgaaaaccc aaagaaaaga aaaacagagg cagttttttg ctctaatgaa tgctctaaat 7860 

ctagctctta attatgattt tttaaggaaa attttgaaaa gtctacaagt taaatttttt 7920 

tttctatccc atacattttc catcctaagg cattgaaaaa gcacactgtg aaatacttag 7980 

tgtatctaga aacatcaggg aagaatgctt ccctcctaag caaaattttg ccttctgaaa 8040 

ctttttcagc attcagtctt tttatataat acttagaaaa atatttctga aatagatcat 8100 

acactctctt cccaaaaaca tcaaagtatg accgtaaagg gcagaggtag gtaaacttct 8160 

tgtaaggggc cagagagtga atatttgaga attttcagac tattaagatc tctgttgcaa 8220 

ctgcttgctt ttgccgtggt agcctgaaag cagccgtaga tagtatggaa atggatgatc 8280 

atggcagtgt tgcaataaaa ctttacaaaa cagacaatgg gccagattgg ccaggggcca 8340 

tagtatgcta cccctgggca acaacctgta tgccctggag tagtgtaaag aacgtgggtg 8400 

ttgggggtca actgacgctt ccagctctac cacttactgg ctgtgtggct ttgggcaaac 8460 

tactgaaaat ctctcagcgt cactttccaa gtgtgtgtaa tgtgtatttt cacagtgctt 8520 

tgcaggttgC tgattattga eiaatagccat aatgcatgaa attaccagac acatctcact 8580 

ttatggagcc tggggctatt ggtaatatgc atttctttct catcttgatc gtaaaatgat 8640 

cttagaaagg tttctgagaa tatatagagt ttaagacagc aataagacaa ctaattaatt 8700 

aaacaggaaa aggggatgtt gtgctcagag aggaagtgtg ggtctcataa gggctttcac 8760 

aatcgtttga gaggacacgt gtgatgtctc atgcctgtta tcccagcact ttgggaggcc 8820 

ciaggcaggca ggttgcttga gttccggagt ttgagaccag cctcggtaat ttggcaaaac 8880 

cttgcctcta caaaaattac agaaattagt tgggtgtggt ggtgcacacc tgtagtccca 8940 

gctgcttggg aggctgacga gtaaggatca cttgagccag catggtggag gctgccatga 9000 

tcatgctact gcactccagc ctgggcaaca gagccagaac ctgtcttgta aagaaaagga 9060 

aaaagagaga gaagggcaga aagaaagaag ggaaggaaga aaggaaaatt gggcccagga 9120 

atgatcttta caatgcctga caaccaagag aagaagggaa atgagcttca cattgcctgc 9180 

aagctctaag gtgacaagag ccaagagaaa ttattgttac tgtagtgatg ttccactgag 9240 

gatcataaag tactttatta ctctactgag tatggttatt ggatatgtgt tcttcttttt 9300 

ctttttcttt atcttttttg ctattctttt gttattcttg atttatgctg atggaaagcc 9360 

atggacccaa ggatgcttca cagttttctt taggagtaaa tgcttagatt ccatgttctt 9420 

tgacatgagc tatgtctgtt cctctcgagt ggaagcatcc tcttcagatg agttgccaga 9480 

aaagcagcca gctctggata agtgaggtac agcagaacac actgcaaata ctaggaatcc 9540 

ttaagtacag tggaacccca aagcactcta cctgctttct ttctcacctc cttaaaaact 9600 

ttttttgccc tcacctcatc atttattcag cagtcacaac agtgccaaga acttggctag 9660 

agattggaaa taaagcttat gccttctctc atatctcctg gaccttattt ctttcttaca 9720 

agaattgtga tgcttaacca gtttttttga taaccttctt ataaatgcca acccttccaa 9780 

aaaacctgcc cccctggtgg agagaagaat tattacatca attaggggtc acttagcatg 9840 

acatttgtcg gaciaaaaaaa agttagtgag cctttttgcc atattsiaaag tcatcactgc 9900 

caagacataa atgaaaatgt gttcgaatta accacaccaa tgttcacaaa ataaacattt 9960 

ttgatttccc aacagaatcc taggtttaac tatcactatc atctttcatg aaatcaaagt 10020 

catatatgta aattgaacac aactttccct tccatagaga gtaaaaacca cgctttggag 10080 

ggtagataca attaccccag ggttgtcttt tcccactcct cacaatccca ccagtgcaca 10140 

tgcaaggtga tgtccttcct ttagctatag caaataatgt taattattgt tggtgttaaa 10200 

taatgattat gcaaagcact agactagaca ttcgtcggca aagtttttct gcaaagaatc 10260 

agatagtaaa tatttttgct cttatcagcc agacagtctc tgcggcaacc attcaagcat 10320 

tgttgaatac attgttgagt gttacatgag actattgtaa tataaaagta gcctgggaca 10380 

acacataaat aactgggtgt ggctgtgtcc taataaaact ttatttacaa agaacaggaa 10440 

gfcggcttgga tttggtatct ggcctggcag ctgtggttta ctactcctta gacggtggcc 10500 

cagagaccct ttaaatgaaa ttcattttac tagcaccctt tttcatcatg agaaatatat 10560 

tctgtttttc ttagaaaatg ggttatgtta ggtctggtca aggtciaataa gtgttgagag 10620 

tcgatgttgt gtgcatagtt aatttcaatt ctttgaagaa gctcccccat gatatttcac 10680 

ggctgagaga agaggaaaga gcttaagtgg aacagtgtgc tttgctgagc tttggaaata 10740 

ttaccatata gggaagcagg tcaataagac aactaagtgc tgtttcaata acgaagatac 10800 

tgaagcgcta attggagtat ggaaccatat aatgatgata ataattgcta atatttatca 10860 

gctatttatc atgtgtcatg tacagctaag cacttacata ccatcccatt ttatccttat 10920 

aatgactcta agagtgggtt aaaaaatggc agaagaatgg cagtgtttaa tggttcagcc 10980 

tggtgcaacg attaaccaca ttttacagac taagaaatta agaacctcaa ccaagttcat 11040 

gctagctggt acgcagtaag gctagaactt cgtccaaaat ctcttcttct gttgagctca 11100 

gcttgcatag tagcttggag ciatcagaaag atctgtctca tttgggaaat ataccatgta 11160 

aaaaacattg tttctaaagg agatttgtcc catgagtaaa atagatgatg gacagccact 11220 

gcctagtggg acaattagaa aggtcagttc aaggttggag gagatgcttc tttcagccaa 11280 

ttttcctttt tctcaggatc acctcaggtg atccgcccac ctcaacctcc caaactgctg 11340 

ggattatagg cgtgagccac cgctcctggc ctttcagcca attttctatc accaaaggga 11400 

aatcgttttg ctggaatatg tggtaaagga ggttaaagtc aaaagaaatt ctcgtcccgc 11460 

tcagttaagg tactcagact attttccaac caatcaaaag gggtgctgct tcatggagtt 11520 

cgtttaagct aaagcggcag ctgttgactg tcatttgcat catctttaaa catttactgt 11580 
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gaatgtcact gtccatttcc actttctttc tacttg\ctt caaattgatg cttatcaagt 11640 

agacagaaga agaccaaggg gtcgttttgc tatttatacc tccaaattga tggcgtgatc 11700 

actctcaagt gcaaacccag ccctgacact gtcctgtttg gcaatgtcct gctatctgac 11760 

ctgcaaatag ctacacttcc tgctgtggcc cacccaggcc tctgggatct gatcccttct 11820 

cccatgtcac ctgtggccat gtcttcccca acccaggctt ctctgctccg ctgcttatgt 11880 

tctggatcct gactctgagc ctttgcttgt gtggctcctc tcccacttgt ccatcatcat 11940 

gtcagttatg taagcataga ttccaaagtg acaatgaggg tagttaggat ggcaagagga 12000 

gaaaaaccaa gctggattca tggattgctt gtgtcacgtc ctctacagag cctcccttct 12060 

acctgctctt acccgggcac cacagttgat gagttatttt ttggaccatc agcaacacac 12120 

ccaatcattg tacatggaat acttcgggga tgcttttttc tatattgatt ttacttcatt 12180 

aaactgagct ccagaggagc agaaactttg gctaatacat cttggtctta gcttgtaata 12240 

tctgtgctac aacttattaa gatggtgacg ctggcaaatt ccttaatctt tctaattctt 12300 

aatttcatca gcaaagatgg gaaaggatac taacaccact ctgggttgtt gagaggattc 12360 

aataactgaa tatttataaa gcgtagtacc tgatacttaa taaaaaagtg aattttaacc 12420 

tgtgtcatca ttgtcatcgt ctttatcatc ctttgcaatt attacattta ctgccttcta 12480 

gtacaaggaa ggggatgggt ggctggctgg ctaggtggat agaaggatgg aagaggtaag 12540 

tacaaggaag gggatgagtg gctggctggc tagatggata gaaagatgga acaggtaagt 12600 

ataaggaagg ggataggtgg ctggctggct agatggatgg aaggggtgga aggggatgca 12660 

agaggtaagt acaaggaagg ggatgggtgg ctggctggct agatggatag aaggttggaa 12720 

gacaggaagg atggaacgga ageiagaggga agaacaaaaa tggaaacaca tagtactgtt 12780 

aggtgaactg aacttctaag gtgccgattc tcagtgatag aatcttgagt tgatacctcc 12840 

ttgggtggca tggagcctat acctttgtag atcttgggaa acaacttcta aaagaatcat 12900 

agttgtatgt aatcgtaagt acaacaataa ttattttagg cacataaccc aaaggttttc 12960 

ttacaaggaa tctatgaacc ttaagagtga gggcttctgt taagagtaag ggcttccctg 13020 

gagtggacat ggatcatggg actgagccag cttggcattg ttgggttgaa cagggagcga 13080 

cacctctcag cccagtctat caagcctgct ctttgacctg cagtgagacc acccacgcag 13140 

acatcaatgc agcaaatccc cggcgtcagg gtttcaacat ttggttactc tcagagaact 13200 

ctcgatttat ataagacttg gaaaaagggt ttgagtttct gtggtttaca attatatttc 13260 

ccaacttggc ccatgaatcc agcttggttt ttctcctctt agcatcctaa ccaccctcat 13320 

tgtcactttg gaatctacac ttacataatt gacatgttga gaagctgtgc taaaaccaca 13380 

ctgaaatcac attttaataa catgggaacc atcttttccc agtaaattgt tgagaagcta 13440 

attcttgtca gcctaaaaac ttgaatatac atttgaataa atcagcggtg ctacaccgtg 13500 

gcagcctgct gaaaatcccc aaggaagatt atatttttag ttgagctact tgtcactgca 13560 

ctgtgttttt aatattgtga gtcctttctt gtcttcattt ttgaagaatc tattgcatac 13620 

cttgtcattc agaaaaacat aaacgggacc tctcaattag cggtaaagtt cgtcagttta 13680 

acttttaagc ttaaactcct gttatagttc ggtcacttac tcgtcatcaa aaagatattt 13740 

gage tga tat tatgcaataa tttataacca aaaacaggag gaaaaggtct ctgtttgtct 13800 

cagaagtaca agttatgcat aaggtgacaa attacacagc tttgggaaat gggtcttaat 13860 

ggaatgcacc aggctaatag aaaagcagtg cctcgatttc ccacctcaga tctgaaaact 13920 

cctgagagac tgacagggcg gttccccagc ttggctgtgt gaggaatcag ctgggaatcg 13980 

aacactgcag tctggctgtc acccccagga ttctaatgta attgttctgc attgtggatc 14040 

gatggatgga tggatgaatg gctggacaga aggaagactt ggaaaaaatc tttgagtttc 14100 

tgtggtttac aattttattt cccaacttgg cccatgaatc cagcttggtt tttctcctct 14160 

taccatccta aggagggagg gagggaggaa aagggggaag gaagcaggta gaciagggaac 14220 

agggaaggga ggactggagg aaggaaagca gggatcattc tggagtgtac actgggcatt 14280 

tgattttaaa gcaaatccaa gagatcatta aatttgtatc ciatgaataca ctgttacccc 14340 

tcaatcaaca ctacaattca tatatcagaa tagcactttg ttacttgttt ctgagatagg 14400 

gctggctctg tcacccagac tggtgtgcag tggcacgatc acagctcact atggcctcag 14460 

ccaccagggt tcaagtgatc cttccactca gcctcctgag tagttgggac tacatgcctg 14520 

caeca tcatg cctggctaat ttttattttt gtagagacga ggtctccttg tgttgtccag 14580 

gctggtctca aacttctgga atcaagtgat cctcctgccc ctgcctccca aaatactggg 14640 

attacaggca tgagccatgt ccagctgcct tgtttttttc tttgaagaga atgatagacc 14700 

ttccatagga aaaatgttaa atatgtgtgt gcagaaataa agatttcaaa tatttttgca 14760 

agatattttt tctaatacca cttttttcta cattttccat aatttagtga agatagtaaa 14820 

ttaacaaagt ggaaaagact gaatatttta agaaaagcca agtttaaaaa ttttgaacct 14880 

aatatttctt aaagtagcta aaattcagat attgagaata aattcaactt gacatggcaa 14940 

aattctaata ggctgaaata atgttttggt ctagaactat gtagctttgt gtagcccatc 15000 

aattgtctaa aaaaagagta acctattttg atgaaactcg ctgtatcttg taacctgtat 15060 

cctgtcttgg tattgtggga gtatatatga tttaggggaa agagtctgga gagaccttaa 15120 

gtctgcttta gggaaagggt gaggaacccc actgaaggct acttaacgca ttttgagaat 15180 

gtcagtaaag atttctcaga gcccagggat ttttttttta attgagacat aatttacata 15240 

caatacaatt ccttttttct tttttttgag acggagtctt gctctgtcgc ccaggctgga 15300 

gtgtgatggt gtgatcttgg ctcactgcaa cctctgcctc ctgggttcaa gtgactctcc 15360 

tgcctcagcc tactgagtag ctgggattac aggtgtgcgc caccacgctt ggctaatttt 15420 
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tgtattttta gtagacacgg ggtttcacca tgttgg1:taa gctggtctca aactcctgac 15480 

ctctagatcc gcctgcctct gtaatcccaa agtgctggga ttacaggcgt gagcccctat 15540 

gcccagccaa aat teat tat tttaagctgt acaattgagt gatttttatt acgttcacaa 15600 

agttacgtta ccattaccac ttttgaattg cagaacaatt tcatcactcc aaaaagaaat 15660 

ctcataacaa ttagcagtta ttttccatcg ctacttcctc cagtcccaag gcaatcagca 15720 

gtctgctttc tatctctacc tgtttgccta ttctggacat ttcatatgag tggagttaga 15780 

taatacatgg ccttttgtga ctggctttca cttagcataa tgttctagag gttcatttat 15840 

gttattgcaa aaatcagtac ttcatttctt tttatggctg aaaaaaattc cattatgtag 15900 

atgtgccata tttgtttatc tggttgtcag tcggatattt ctgttgttcc tactttttgg 15960 

ctactgtaaa agatgctgct atgaacatta ttgtgattat tataccttat ttgtaaacat 16020 

catgggtggg gggttgcagt aaacatgttg gaaagtaggg ttggaggtcc gtagaaattg 16080 

ggggcttcag cacttccccc aagctcaaca ccaaccccct ttctgagccc ctcttgaagg 16140 

agagttccct gggacgtgcc tggtattggt acaatcagtc aggaagcatt tttcctgggg 16200 

agaaacttac aagtccacga tcaaagccaa caagagacaa ggtgttacat gactcatttt 16260 

cggtttaaga agtgacaggc tgattctaag ttgggttcaa ttattttgtt aaagcgtttt 16320 

gcttatttga cttctcctga cctcggaaat aattctaacc aatcagtgct ggctcccatt 16380 

ggccctgggg tctggttgct ttacagctgg tgacaggggg accactccac taccacatgt 16440 

gaattaatcc tcaactccag agccaagtgc cattctccag caaggttgta tttcttcatt 16500 

agctattccc agggcccaga aagtcccaga ggatgtcaga gtacattaat ttttatcata 16560 

acatggaatc tttcaggtct gaatggcagc acacggctgt caggggcttc tgaactctat 16620 

tacagctcca tatatctcta ggcaaaacag aggaaagagt cgtcattggc aagggagatg 16680 

tacaaaatgc atgagatgtt ttattttttg agtgacttga ccacgtgctt aagcacattc 16740 

cccaaacaat ttttttctta ttgtttgtaa gttgtaagtt gtaaattcac ctctgccacc 16800 

acctattaaa gcccactccc tgcattaaaa ctgtataaag tgtatttaaa taaactctct 16860 

ttgcatgatg tgaatgaaat cgtcatctgg tacttaaaac tattctataa agttattaaa 16920 

aaattaatgt tcccttccca tgatttttct gcagaattta tgcatccatg atactgcaga 16980 

agttcataaa taatggcttg tattgctgct ttagtattgc tttatgccta cgaaatataa 17040 

tgttaatttg tagcaatgct aatgtgtttt caggaaggct ctttgtttat tgcctttatt 17100 

ttccccactt accaagtggg taaaatgctt tgagggttgc attttatgta ttcaggaggc 17160 

ccaggtatta ttttaataga agcactattg acaaatacca gtcatccccc ctgtgccagg 17220 

ccctggatga ggcactgctt cgcatggggg ctccccagat tgtcccacaa ggaaagcata 17280 

gtcaaagaca aagttttcag ttgtaagagt aaatgtgttc tgcctaggca ttgtcaagta 17340 

atttactgcc agctctagcc cttcactcaa gtttcctgga tacttttgac ttcttagcca 17400 

tggatgtgtt tgaaggctgc atggaccttc acttacttgc actgcaggtc agcctaattg 17460 

catgagctct gtggaccaca gagcagggtt ttccaaagtt caccaagaca aatattgtat 17520 

tatcttaaca tatattcatt ttttaaaact gaaaatcaga agagcaactc cacctagcag 17580 

aagtcttttg caaagggcga ggcgaggcta aaaagtatag aagagttcgt ttccagtgca 17640 

attttataaa cacagatggt ccttaaatta agcaaatggt acctaaatga ctgtgttgtg 17700 

gataatggta acagagggag ggactcgggg gttttttaaa aagtactgat tgtatgcagt 17760 

gttttaaaca gataactgtg atcttagtgt gatgaaagat gctgggagat ttcaccagtg 17820 

gtatcttatt atttttcggg gattttgtaa ttcaacaaaa ttctgttgta tgccaagcat 17880 

aaccctaggt gtgagagcac aaggtgactt cagataccac ctctatcctt caggggtttg 17940 

gggcccatta ttatgactta atccattttg ggcgtgagaa gctgagggtc acagaaagaa 18000 

ccaattccct ctttaaataa tgccacccca accctcctca tctgccaggt ctttcccttc 18060 

ttctatttgt atgaatciata gtcactttct cttgtggagt tcgctaaatt ctactttggc 18120 

ctatcaaatt tctttcatat cacaactaaa tttcttaagg acgggactat ggttcatttg 18180 

tcagacgsiac aaatgggaat ttgccciagag acacttgggt taatttacgt cttttccatc 18240 

caagggcact atgttgaagt gaggctagta ggtcatgagt gtggttgaag ttactttttc 18300 

ttactttccc gaccagcccc catccttact gcacttaaag ttgattgtcc attttattaa 18360 

atgtccccag gaagccagaa cacagggcag taaagtgctg aatgcaaagg gcaggagaaa 18420 

aatggaaaca accagaactg taacaccaag gaatgagacc tgcatgtcag atatcatgcc 18480 

cattgcacta agtgccattg gggcacaatt atcaaatgga tgcattttcc ctagaaaacc 18540 

atcttggaga gcatgtggat gtacttctat tttacatttc cccctattta caatcaatga 18600 

gattgagatt ttgttgctgg gactgctgat gatgggatgg gaaaatataa tcaaggtaat 18660 

ggacatgagg caaaaattta aggaaatgac aaaaacaaga gtatttccat tttcagttaa 16720 

gtgtatgtac tgatgttctg gaattcacta taagaagttg caaatggtgc atgaaatgaa 18780 

aaattcctgg tggtctccag gggacacagc cggtgctgtg ctccactctg ggtaactgtt 18840 

ttggattatt ttctctattc caactgaaat aaaaaaaaat taattaaatg tggctaggtt 18900 

atcttgacag cagaatccat tcccagttaa ttattatttt aatacttgat ggtgtctgtc 18960 

aaattgtcga catgtgacgg tcctttcaaa tttaaaggaa tagctgatgg tcactggcca 19020 

cccaagctga tactgatttt atatgttgat gtttctcatt ttatttgctc tttccttgaa 19080 

t at t tat tea ggacattctc taccagaeat attgagtaag ggcaaeagaa acaataeata 19140 

agtatcttat aaatgtggaa aacaatgtat atgtgttttt tatctetcaa tgattggtgg 19200 

gtaccatatc cccaciagtag aatgagcatt tgagaaaaca ggaaatatcc tcttttaggc 19260 
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accatctctg tcaaggctga tgctgggctt tttata^att ttctctaatt cttgtggctg 19320 

tcaaacaagg tgggcattat cattcccttt ataggggaca cagctgtggc tcagaggggt 19380 

ttattcactt tcctgagggc cacacacata atgagaggca gacacaggtg acgaagtgag 19440 

ttttccctgt cacgccatct tatctgtcac atacctctct gacatgctaa aattgcacta 19500 

aacaaaagaa ttctcttatg cacatatcat gcaaaagata ttctttaact ggggatcatg 19560 

tttctcattc catcaataga atgactaiaca ttttctgagg gtgtctcacg tgaaagtaaa 19620 

tcgctcatgt ttgttctttt taaaagatgc ccttcgtatt gtgtatcttg cagtcttgct 19680 

ttctcaaact taagccaact atatcgtcat ttttgcaaaa tcactgcgtc agtttactat 19740 

tatttaatgt ttattgctac caattttaag aaatccttta taggactatt tgtgaaattg 19800 

attttgtgag gatgatgata taatttccat tacattacag catataaata taaatatata 19860 

tatatatata tatatatata tatatatata tatatatata tatattttat tatttttttt 19920 

tgagacggag tcttgctctg tcaccaggct ggagtgcagt ggtgcaatct cggctcactg 19980 

taacctccgc cgcccgggtt caagcgattc ccctgcctta gcctcctgag tagctgggac 20040 

tacaggcatg tgctaccaca cccagctaat tttttgtatt ttagtagaga tggtttcacc 20100 

atgttggcga ggatggtctc agtctcctga cctcgtgatc cgcctgcctt ggccttccaa 20160 

agtgctggga tatacatttt tttttttttt ttgagagatg gagtgtcact ctgttgccga 20220 

ggctggagtg tagtggcgca atctcggctc actgcaacct cctcctccgg ggttcciagcg 20280 

attctcctgc ctcagcctcc ccagtagctg ggattacagt cgtgtgccac cacgcctggt "20340 

tagtttttgt atttttagta gagatgggtt tcactgtgtt ggctaaggtg gtctcaaact 20400 

catgacctca agtgatccgc ccgcttcagc ctcccaaagt gctgggatta caggcgtcag 20460 

ccactgtgcc cggccggata gaaataattt ttataaactc cttggatgct acctaaaatc 20520 

atcttgtttt gctagtggca catgctgcat tttgggcagc tgtggccttg gtggattgct 20580 

gaagtagatt tgaccttacc tggactgagg cagctgttga agggaattgc tgtgttcagt 20640 

gtatactgcc atccatgatt tcatgaaacc agctctagct atttaagcag gggtcaaact 20700 

tagaattcta cattattttt ttcccttttc tgggaggaaa gacagttgaa caccagcaaa 20760 

gactaagaaa tttcttagaa gactgtgggt ccttgggccc tttctattga atttcagagt 20820 

atttccaaat actatgaagt cttgcagctt agttgagaaa tgccccagat ggtgtgacat 20880 

tctgcttcca ggagggattg gaaagtattt ccttttacat aacattccac tcagctcatt 20940 

cctttgctgt gtctgaaatt gaatccccca aagccacaat tatcttaaca ttcagaagag 21000 

tgtttattta atctgcaaaa tcttgcctca cttttgggga gcatgttaac aatttcactt 21060 

acaaatcttc tgtgtciactc aaccccatgg tggtgtctac tgctgctcct agactcttta 21120 

aagcaccttt ctcatctcag gtttgaaatg atatgtctca ttcttgggtt ccttgagtcg 21180 

taatgggttt gtcttgtctc cacagcataa atgactcttt cttgatcaac tagaaccaca 21240 

tcaacttctt ccctccagct tcagtgatat attgtgaaac atggctattc aacgtcctgt 21300 

agaccaaatg ccataagaaa aatagcattg attcaaacgt atccatccag atacctaaaa 21360 

aagttttact tcttaccaca tcttgagtct gggcaaacac gcacttccta tggacattga 21420 

ttactgtcta ctgtagagat aacatttgca catacagatt atggcacatg gtagaaagtg 21480 

ttaagtaatg taggaatgga catatcccaa gcaaaattgg aagccaagtc ccctgtccct 21540 

gctcaagttg gtatgactgg tgtatggtgc cttaatgggt acttaaagtc caggtgagag 21600 

tggcaggagg cagccaaatg cctaggtaga taggagccgg tccctgttga aaccccactt 21660 

ccaagttgaa gacagtttaa agactgaaag ccaagctaca agttaaatcc tcggaccaga 21720 

ttgagaactt gtcttcttac ttggtgcact cttctgattg atccccacct ttcacctatt 21780 

ttacatattc ctgcccttcc ctaactggtt tcctatgctg tcatgcccac ctttgagtgt 21840 

tgccttcact ttaaccttct gtgcatgctc acaaagtaat tagcatgtac cctccattct 21900 

gagttaatat ciaggccccag acccagccac atggggcaac tttactgcct tcaggtaggg 21960 

gaaccacccc caccacattc cctctccact gagagttttc cttttagtta ataaattcgg 22020 

ctccactcac tctccattgt ctgcatgcct aattcttcct ggttgtgaga ceiagcagttg 22080 

gacctagctg agctaaggag cagaciagact gtatcacagg gaacttgtgt aacagcttga 22140 

tctcctgtcc tacgtagcta tctattggta agaagttgcia ggaacttgtg tcattccgtt 22200 

gtgcctgtcg tcttgacctt gtaaaaggtc ttgggtaagc atgcaagaag ttttgaagag 22260 

ggagatacag ctaatttgca gataaagagc aagggaagaa ttcctggaga aaggaagagt 22320 

ttcctgagtc acctttggga ggtaggaagg gtttgacatg taagctgggc atctgggaac 22380 

gagtgaggga ttctgtgagc cccatctcag tggaccactc aaggaaggtg ggtaagccct 22440 

gggtaataag tgtgtaagca gggaacagaa agtactgtga tttaaaatat gttaattttt 22500 

ctactgtaca gatgagagcc agcttggaga tgggctgtag ctcaagcatc ttacctacct 22560 

ctgatttctt aatgccacgt tataaggctg ctgcttatag ctcttgaagt cactccaaaa 22620 

acagatgagt gagaccctgt tgctaaagtc ccactgggtg tagattattc acagatgtat 22680 

acacagtggc tcactccagg taggatgtga tcagtgcttt tagaaataca gaaagtccta 22740 

ttggtttaaa aaaaattttt ttttgtaatg aattgagttt taaagctagc actgtacaat 22800 

aaaagggtga atttcactat gaattatgac aaacacagtc atagagctgc catcatcact 22860 

gtcaagatac agaacagtgc catcaccccc caaatgtcct ctgtgcctta ctg tag teat 22920 

acctgctcct gacacctagc ccctgggaac cattgatctt ttttctgtcc ctagttgttt 22980 

gtcttctctg caatgttaca taaacagaaa tattctgggt gtcgcttttg gagtttggat 23040 

tccttcccat agcataatgc atgcagtatt tgtctgtttc tgctciaatta cccccaagac 23100 
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ctagcgactt 
taataaatCa 
ggttaaagat 
gctttctaga 
ccacaacatc 
ccctctagcc 
gattatctag 
catgccatgt 
tcatgggggt 
acatgatatc 
ctcccctttt 
catgcttctt 
tccagcctca 
cttgatcacg 
gggttaggat 
tcaggaattt 
atcaagatgt 
gaggtcgagg 
tgaaacgcca 
cagttagttg 
caactgagat 
aaaaaaaacia 
ggacccaaca 
agctataatt 
tcatggggca 
agaacaggag 
atgatttttt 
gtgaggggat 
accctgccta 
ttctttttaa 
gagggcteiat 
aaaccttgaa 
taatcaagtg 
aaggagaagg 
tgtgtttaat 
attagaatag 
tgggagaatg 
gaacagagct 
gctgagcctg 
gagcggagag 
gggagctctg 
tgtgggggtg 
ggagttcatg 
tttaggggtt 
gaagatggga 
atcagccatt 
aagagtttag 
actgacaggt 
tttgaagtct 



aaaacagcaa 
ccacaaactt 
ttgtcatggt 
gactccaggg 
ttcaaaacca 
acaatcggga 
catgagctcc 
tgaaacataa 
ggaaccctca 
tggcaccttc 
gccttccgcc 
gtagagtctg 
gttacttctt 
gtctgcagca 
gtggacttct 
agggacagtt 
cagctggggg 
tgagtggacc 
tactaaaagt 
gggggctaag 
cacaccactg 
aaaaaaaaag 
tcatgcatgg 
cctgactatt 
gttgataaca 
tgatacagaa 
ctatattctg 
tacacaaaag 
ggacagtgcg 
tgttgagtaa 
caacgtgttg 
gaacaattag 
aggggcgtac 
gggtgtagtt 
ttagcagaca 
tattttaata 
tagaggtgac 
agaaacatgg 
ggatggggat 
ggaagcaggc 
gagaaggaat 
atgacttccc 
ctccagaggt 
cctaatgagc 
tagcacattg 
acagaaataa 
aCttaggcac 
gggaatgaga 
tcatggcgat 



acatacacta 
atgtgcttaa 
tctcattggg 
agaatatttt 
gcactgttgc 
aaggatctca 
ctgtctcaag 
tccctagtgt 
tgcatgactt 
cttcctctgt 
atgactgtaa 
cagaatcgga 
tattgcagtg 
tgtcttttac 
ttgggggact 
tggctggttg 
ctgggcatgg 
atttgaggtt 
acaaaaatta 
gctggagaac 
cactccagcc 
ttagctgggg 
ctgttggcag 
gaccagaggc 
cagcagttag 
gccagcatct 
ttagttagaa 
tatgaaaacc 
Cttgagatcc 
tatgccattg 
attagaggga 
gaatttgaca 
tcatccatgg 
gtggctgttg 
tttaggaacc 
agtttaacta 
aagaccaggt 
agtgacatga 
tcttgttcct 
tttggtttct 
tgtatcaccg 
agatgtgagg 
agtgcctacc 
ctagagatac 
ttttcctaag 
agggtatcag 
gttgaccctg 
ttctggaact 
tttgttcgga 



tctcac^ctg 
aacccaaagc 
ctaaaattaa 
tccttgcctt 
atctctgacc 
ggactgttgt 
gtgatagagt 
tggcggtggg 
acggccatcc 
tgctcttgcc 
gcctcctgag 
gccaattaaa 
caagaatagc 
catgtaaggt 
tttacttttc 
tttctggccc 
tggctcactc 
aggagtttga 
gttgggcatg 
agcttcaaaa 
tgggtgaaag 
ccaaggfccat 
aggtcacttc 
gaacctcagt 
cttccattgg 
ttttgtaaac 
acaaattcct 
aggciagcagg 
atgttattct 
tgtgtttata 
aattttcttc 
gagggaatgg 
ggcagtggat 
ctcagaagga 
atCaaaagtt 
ggctttaacc 
gtgggctatt 
agtcattgtc 
atgatcgatt 
actggaaatc 
agttggtcct 
gtgccctcca 
tctgtggaga 
tgacattgac 
tatttgtaat 
gacggggagg 
aatttgtggg 
tggaaagaat 
gtcatggatt 



ttagttttcc 
tattatctga 
ggtgtcggta 
ttccagcttc 
ttagccctgt 
gatgacactg 
ttgcatgttt 
tgtgctggga 
ctttggtgat 
ctcaccatgt 
gcctcaccgg 
tctcttttca 
ttaacacaca 
aatatgttca 
ccagttacta 
agggtctttc 
ctgttatcac 
gaccaacctg 
gtggcacatg 
caggaggcgg 
agcaaggccc 
ctcaagtctt 
ctcatgagca 
tccctgccat 
attgagtaag 
caatctcaga 
aggtctcacc 
gagcattggg 
atccattagt 
tattgctaat 
agtgaaataa 
cacgaataaa 
gattcacctg 
cctgaatgtc 
tttgagaatg 
agCattcata 
gcagtgacta 
ttaggttggt 
aggggactgt 
tgtttccgcc 
ccttgtcaac 
gcaatggaca 
agtaggggat 
cagtcatgga 
caggtgactg 
agatgtgagg 
aaggtatcca 
atagtggcta 
tggciagaaac 



attgctgttg 
cagctctgta 
tggctctgct 
taaaggtcac 
agtcccattt 
tgcttaccta 
ttctcctgca 
ggtatttgga 
aagtgagttc 
tagctgccta 
aagccaagca 
ttataaatta 
aggtcttacc 
gtggctgtgg 
tttttgtgac 
ttgggctgca 
agcactttgg 
gccaacatgg 
catgtaatcc 
aggttgtggt 
cacctcaaaa 
gactatgaaa 
ttaciactgaa 
gtgggcctct 
caagagagca 
agaattgtcc 
cacccttgag 
agtcatttgg 
agtagtttgc 
ggattttaaa 
tatttgagca 
gacccagact 
cctggagctt 
aggatgcatt 
ggtaggtaag 
aaaagccaag 
cacctaaaag 
gatcctagca 
tcccagagga 
cgattccatg 
agagcatgtg 
gttatctgga 
ggcacggtat 
gactaaagga 
gtttgcaaat 
agagaagatg 
ggaagataag 
gagctaaggg 
tgctgcaggg 



23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
23940 
24000 
24060 
' 24120 
24180 
24240 
24300 
24360- 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
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